fix up readme.md
fix up readme.md Signed-off-by: qingjun
Every story across every category, newest first. Each card links to the original publisher; daily-brief posts open as editorial pages.
fix up readme.md Signed-off-by: qingjun
Update the vllm_deployment_guild_cn.md and vllm_deployment_guild.md files to explicitly specify the model names as MiniMax-Text-01 and MiniMax-VL-01, and revise the related descriptions to improve the accuracy and…
Benchmark results for Qwen3 models using the Aider polyglot coding benchmark.
Mistral launches Medium 3 — claimed ≥90% of Claude 3.7 Sonnet at $0.40/$2.00 per M tokens — plus Le Chat Enterprise. OpenAI names Instacart CEO Fidji…
The $6.32 benchmark cost reported for Gemini 2.5 Pro Preview 03-25 was incorrect.
OpenAI agrees to acquire Windsurf for ~$3B — its largest acquisition to date. NVIDIA + ServiceNow unveil Apriel Nemotron 15B open-source reasoning model for enterprise agents.…
更新 vllm_deployment_guild_cn.md 和 vllm_deployment_guild.md 文件,修改模型名称为 MiniMax 系列,新增 MiniMax-VL-01 模型获取说明及相关命令。 Signed-off-by: qingjun
OpenAI reverses course on for-profit conversion. The nonprofit will retain control; the for-profit LLC converts to a Public Benefit Corporation. Bret Taylor publishes board chair letter;…
Quiet Sunday. The dominant AI-adjacent story is the White House's AI-generated Trump-as-Jedi Star Wars Day post — fans noting the red lightsaber canonically belongs to Sith…
Learning to automate simple agentic workflows with Amazon Q CLI, Anthropic MCP, and tmux.
Quiet Saturday. Industry digests Qwen3, DeepSeek Prover-V2, Llama 4 and the OpenAI sycophancy postmortem. The AI Diffusion Rule is still scheduled to take effect May 15;…
OpenAI publishes the long-form sycophancy postmortem 'Expanding on what we missed' — explicit sycophancy evaluation will be added to launch process; behavioural problems will be launch-blocking;…
Anthropic ships Integrations (10+ partners) and 45-minute Research mode. OpenAI fully rolls back the GPT-4o sycophancy update. ChatGPT shopping search rolls out worldwide — 1B+ web…
There is no capability threshold that will lead to sudden impacts
Special thanks to John Schulman for a lot of super valuable feedback and direct edits on this post. Test time compute (Graves et al. 2016, Ling,…
DeepSeek releases Prover-V2 in 7B and 671B-MoE variants. 88.9% pass-rate on MiniF2F-test SOTA. Visa unveils Intelligent Commerce with Anthropic, IBM, Microsoft, Mistral, OpenAI, Perplexity, Samsung and…
Meta LlamaCon: Llama API preview, standalone Meta AI app on Llama 4, Llama Guard 4 + Prompt Guard 2 + LlamaFirewall + CyberSecEval 4, Cerebras serves…
Nope what’s that?Isa Fulford: me at the iclr openai recruiting event: random man:have you heard of arxiv?
Official Llama API Now Fastest via Groq Inference
Alibaba launches Qwen3 — eight open-source hybrid-reasoning models (six dense 0.6B-32B, two MoE: 30B-A3B and flagship 235B-A22B) under Apache 2.0. First lab to ship togglable 'thinking…
QWEN CHAT GitHub Hugging Face ModelScope Kaggle DEMO DISCORD Introduction Today, we are excited to announce the release of Qwen3, the latest addition to the Qwen…
Sam Altman publicly acknowledges GPT-4o has become 'too sycophant-y and annoying.' Fixes promised that day and through the week. Otherwise a quiet Sunday — Qwen3 launches…
Quiet Saturday. The GPT-4o sycophancy update from Friday is live in ChatGPT and the viral screenshots start spreading. Alibaba's Qwen3 family is two days away. No…
Manus AI (Butterfly Effect) raises $75M Series B led by Benchmark at $500M valuation — 5x its prior price. US VC backing a Chinese AI agent…