AI Daily Brief — 07 February 2025
Friday delivered a bipartisan US House move against DeepSeek and the close of Sam Altman's whirlwind Asia tour. Gottheimer-LaHood introduced H.R.1121 — the No DeepSeek on…
Friday delivered a bipartisan US House move against DeepSeek and the close of Sam Altman's whirlwind Asia tour. Gottheimer-LaHood introduced H.R.1121 — the No DeepSeek on…
Thursday delivered the year's first sovereign-AI consumer launch and Amazon's Q4 capex shock. Mistral released Le Chat mobile with Cerebras 1,000 tok/s; Macron endorsed it on…
Wednesday delivered Google's full Gemini 2.0 family in production. Flash GA at $0.10 input / $0.40 output per million tokens with 1M context; Pro Experimental with…
Tuesday opened with a measured Chinese counter-strike and a methodological landmark from Anthropic. Beijing answered Trump's tariffs with 15% duties on coal/LNG, critical mineral export controls,…
Monday opened with the year's most consequential bilateral AI announcement. In Tokyo, Sam Altman and Masayoshi Son unveiled SB OpenAI Japan — a 50-50 joint venture…
Sunday delivered the EU AI Act's first hard deadline. Article 5 prohibitions went live across all 27 member states — eight categories of "unacceptable-risk" AI practice…
Saturday opened February with a Washington tariff salvo and an Altman Asia tour. Trump signed three executive orders imposing a 10% tariff on China imports and…
Friday closed January with OpenAI's direct answer to the DeepSeek shock. o3-mini and o3-mini-high shipped in ChatGPT and the API, including — for the first time…
Add trt support for BF16 (#195) * fix interface of `get_sample_input` * save configuration parameters * ae wrapper implemented * fix import * add AEWrapper step…
Thursday made the regulatory backlash formal. Italy's Garante imposed an urgent limitation on DeepSeek's processing of Italian users' personal data. Anthropic published the "Constitutional Classifiers" research.…
Lunar New Year's Day delivered a multi-front escalation. Alibaba released Qwen2.5-Max — an MoE pretrained on more than 20 trillion tokens. Bloomberg reported Microsoft was probing…
My thoughts on China, export controls and two possible futures https://darioamodei.com/on-deepseek-and-export-controls
Tuesday turned into the day Washington picked a side. White House AI czar David Sacks went on Fox News with "substantial evidence" that DeepSeek had distilled…
QWEN CHAT API DEMO DISCORD It is widely recognized that continuously scaling both data size and model size can lead to significant improvements in model intelligence.…
DeepSeek's API has been experiencing reliability issues. Here are alternative providers you can use.
Monday delivered the largest single-day market-cap loss in US stock-market history. NVIDIA fell 16.97% to close at $118.58, erasing approximately $589 billion in market value in…
Sunday delivered the consumer milestone that would lock in Monday's market response. DeepSeek hit #1 on the US Apple App Store, displacing ChatGPT and topping charts…
Tech Report HuggingFace ModelScope Qwen Chat HuggingFace Demo ModelScope Demo DISCORD Introduction Two months after upgrading Qwen2.5-Turbo to support context length up to one million tokens,…
QWEN CHAT GITHUB HUGGING FACE MODELSCOPE DISCORD We release Qwen2.5-VL, the new flagship vision-language model of Qwen and also a significant leap from the previous Qwen2-VL.…
Saturday was the calm-before-the-crash day. With US markets closed, DeepSeek's app daily active users surged more than 110% versus the prior week and weekly unique web…
Friday closed Davos with AI as the dominant theme and silicon as the unsolved bottleneck. Jensen Huang called AI "the largest infrastructure buildout in human history."…
R1+Sonnet has set a new SOTA on the aider polyglot benchmark. At 14X less cost compared to o1.
Thursday rewrote US AI policy and launched the year's first headline agentic-AI product. Trump signed Executive Order 14179 revoking Biden's October 2023 Safe AI EO. OpenAI…
Wednesday delivered three simultaneous shocks. ByteDance released Doubao 1.5 Pro at API prices roughly 50x below GPT-4o and 200x below OpenAI o1. DeepSeek-R1 and Moonshot's Kimi…