Scaling How We Build and Test Our Most Advanced AI
Scaling How We Build and Test Our Most Advanced AI
Every primary-source story across every tracked model. Filter by clicking a chip.
Scaling How We Build and Test Our Most Advanced AI
Merge pull request #181 from lumalabs/release-please--branches--main--changes--next release: 1.21.0
Claude Mythos Preview is a new general-purpose language model that is strikingly capable at computer security tasks. This post provides technical details for researchers and practitioners…
Merge branch 'main' of https://github.com/zai-org/GLM-V
How Alta Daily Uses Meta’s Segment Anything to Reimagine the Digital Closet
Hint: it's not benchmark scores.
The Batch AI News and Insights: Voice-based AI that you can talk to is improving rapidly, yet most people still don’t appreciate how pervasive voice UIs…
[skill] glmv-stock-analyst (#263) * add stock analyst Signed-off-by: JaredforReal * rename Signed-off-by: JaredforReal * update Signed-off-by: JaredforReal --------- Signed-off-by: JaredforReal
OpenAI ships GPT-5.4 mini and nano, faster and more capable but up to 4x pricier, DLSS 5 looks like a real-time generative AI filter for video…
We often lack the tools for the job, even if the AI is capable enough
New orgs! New types of models! With Nemotron Super, Sarvam, Cohere Transcribe, & others
Merge pull request #262 from zai-org/skills_format format with skills
Merge pull request #261 from JaredforReal/skills [Skill] add prd-to-app and web-replication skill
upgrade web-replication skill.md Signed-off-by: JaredforReal
add prd-to-app and web-replication Signed-off-by: JaredforReal
[skill] add pdf skills && add doc-besed-writing && change DEFAULT_MODEL to glm-5v-turbo (#260) * add pdf skills and doc-based-writing and change to glm-5v-turbo Signed-off-by: JaredforReal *…
feat(internal): implement indices array format for query and form serialization
The Batch AI News and Insights: The anti-AI coalition continues to maneuver to find arguments to slow down AI progress.
SAM 3.1: Faster and More Accessible Real-Time Video Detection and Tracking With Multiplexing and Global Reasoning
Update: Further details on this exercise are included in our Frontier Risk Report (February-March 2026), within the Anthropic section of Appendix B. In collaboration with Anthropic,…
Human-Computer Interaction and Visualization
Human-Computer Interaction and Visualization
add glmv-grounding skill (#259) Signed-off-by: JaredforReal
chore(ci): skip lint on metadata-only changes Note that we still want to run tests, as these depend on the metadata.
We track 28 AI models across text, image, video, and audio domains. Each model has its own filtered news feed — click a chip above to see only that model's primary-source coverage. Tagging is automatic at ingest time using a strict title-only keyword match (so a benchmark post that mentions five models in its summary only shows up under whichever model is named in the headline — no thin-content drift).
Text / LLMs: GPT, Claude, Gemini, Gemma, Llama, Mistral, Grok, Qwen, DeepSeek, Kimi, GLM, MiniMax, Yi, Hunyuan, Command, Phi.
Image generation: FLUX, Stable Diffusion, Midjourney, Imagen.
Video generation: Sora, Veo, Runway, Luma, Kling, Pika.
Audio / voice: ElevenLabs, Suno.