AI Daily Brief — 1 April 2025
Sam Altman teases OpenAI's first open-weight model since GPT-2 — public feedback form opened. OpenAI Academy launches as a free AI learning platform. Amazon unveils Nova…
Every story across every category, newest first. Each card links to the original publisher; daily-brief posts open as editorial pages.
Sam Altman teases OpenAI's first open-weight model since GPT-2 — public feedback form opened. OpenAI Academy launches as a free AI learning platform. Amazon unveils Nova…
OpenAI closes $40B SoftBank-led round at $300B post-money — largest private tech raise on record, with a December restructuring cliff. Runway ships Gen-4 with reference-image character…
Sam Altman tweets 'can yall please chill on generating images, this is insane, our team needs sleep.' The Ghibli wave peaks. OpenAI $40B SoftBank-led round wired…
How I started, why I write, who I write for, how I write, and more.
Quiet Saturday — the Studio Ghibli wave keeps cresting and OpenAI's rate limits stay in effect. No frontier-lab releases, no funding rounds dated to the day.…
Welcome to the next stage of large language models (LLMs): reasoning. LLMs have transformed how we process and generate text, but their success has been largely…
xAI acquires X in all-stock merger — $80B for xAI, $33B for X, combined entity $113B under xAI Holdings Corp. CoreWeave (CRWV) debuts on Nasdaq at…
Anthropic ships a landmark double-paper on Claude 3.5 Haiku's internal mechanisms — circuit tracing, multistep planning, cross-linguistic generalization. CoreWeave prices its IPO at $40/share, raising $1.5B…
QWEN CHAT GITHUB HUGGING FACE MODELSCOPE DISCORD Introduction Last December, we launched QVQ-72B-Preview as an exploratory model, but it had many issues. Today, we are officially…
OpenAI delays 4o image gen rollout to Free tier as demand 'wayyyy more popular than we expected.' Alibaba open-sources Qwen2.5-Omni-7B end-to-end multimodal model under Apache 2.0.…
QWEN CHAT HUGGING FACE MODELSCOPE DASHSCOPE GITHUB PAPER DEMO DISCORD We release Qwen2.5-Omni, the new flagship end-to-end multimodal model in the Qwen series. Designed for comprehensive…
I’m freezing this blog and starting to post on my Substack instead. The authoring experience is much more convenient for me there. Please follow me there,…
Build Fast with Text-to-Speech AI – Dialog Model on Groq
Frontier collision day. OpenAI ships native 4o image generation in ChatGPT and Sora, killing DALL-E 3. Google DeepMind drops Gemini 2.5 Pro Experimental — #1 on…
Training Diffusion Models with Reinforcement Learning We deployed 100 reinforcement learning (RL)-controlled cars into rush-hour highway traffic to smooth congestion and reduce fuel consumption for everyone.…
DeepSeek drops V3-0324 on Hugging Face with no model card, no blog, MIT license, 685GB weights — Aider polyglot jumps 9.3, AIME jumps 19.8. Runs on…
Quiet Sunday before a big Monday. Western press finally catches up on Tencent's Hunyuan-T1 reasoning model. xAI Grok standalone app continues weekend rollout. No fresh announcements…
QWEN CHAT GITHUB HUGGING FACE MODELSCOPE DISCORD Introduction At the end of January this year, we launched the Qwen2.5-VL series of models, which received widespread attention…
Quiet Saturday after a heavy GTC week. xAI launches a standalone Grok iOS app, decoupling the chatbot from X for the first time. No major frontier-lab…
RT Kai-Fu LeeThe biggest revelation from Deepseek is that Open Source has won. For a 1% difference in performance, it will be difficult for OpenAI to…
Tencent ships Hunyuan-T1 — first ultra-large hybrid Mamba-Transformer MoE reasoning model, matching DeepSeek-R1 and beating GPT-4.5 on MMLU-Pro at ~99% lower price than o1. NVIDIA closes…
NVIDIA hosts inaugural Quantum Day at GTC with D-Wave, IonQ, Rigetti, Quantinuum, PsiQuantum and others sharing a stage. Anthropic ships web search for Claude. Foxconn showcases…
RT Kai-Fu LeeDeepSeek is becoming a Windows kernel demanded by businesses, but http://01.AI is aspired to build the Windows system and interface to ignite it. Check…
A new tool that improves Claude's complex problem-solving performance