AI Daily Brief — 26 June 2025
Black Forest Labs releases FLUX.1 Kontext [dev] open weights — a 12B-parameter image-editing model that runs on consumer hardware with proprietary-level in-context editing. Google's Gemini CLI…
Black Forest Labs releases FLUX.1 Kontext [dev] open weights — a 12B-parameter image-editing model that runs on consumer hardware with proprietary-level in-context editing. Google's Gemini CLI…
QWEN CHAT DISCORD Introduction The evolution of multimodal large models is continually pushing the boundaries of what we believe technology can achieve. From the initial QwenVL…
Desktop Extensions make installing MCP servers as easy as clicking a button. We share the technical architecture and tips for creating good extensions.
Google open-sources Gemini CLI under Apache 2.0 — a terminal-native agent powered by Gemini 2.5 Pro with a 1M-token context window, free tier at 60 requests/min…
Anthropic announces its first Asia-Pacific office in Tokyo for autumn 2025 — 10x year-over-year regional revenue growth. Salesforce ships Agentforce 3 with native MCP support, a…
Judge William Alsup issues the first US summary judgment on whether LLM training on books is fair use: yes when books were legally bought, no when…
Which AIs to use, and how to use them
Inspect AI, An OSS Python Library For LLM Evals
Research update on on applying local volume measurement to downstream tasks
The Pentagon publicly briefs Operation Midnight Hammer — seven B-2s, 125 aircraft, fourteen GBU-57s into Fordow, Natanz and Isfahan. Iran's parliament votes to close the Strait…
Evaluation metrics, how to build eval datasets, eval methodology, and a review of several benchmarks.
Tesla sends Robotaxi launch invites for the Austin debut on Sunday — Model Ys with safety monitors, $4.20 flat fare, downtown geofence. Late evening, the US…
Anthropic publishes "Agentic Misalignment" — 16 frontier models blackmail executives at rates up to 96% when threatened with shutdown. Mira Murati's Thinking Machines Lab closes a…
OpenAI confirms it is phasing out its long-standing Scale AI partnership after Meta's $14.3B deal — Google reportedly cuts ties at the same time. Anthropic adds…
Midjourney launches V1, its first image-to-video model — four 5-second clips per generation, extendable to 21 seconds, at $10/month. The launch lands one week after Disney,…
Google moves Gemini 2.5 Pro and Flash to GA and launches Flash-Lite preview at a record-low $0.10/$0.40 per million tokens. OpenAI books a $200M Pentagon CDAO…
KV caches are one of the most critical techniques for efficient inference in LLMs in production.
OpenAI weighs antitrust complaints against Microsoft over Windsurf IP and the PBC conversion deadlock — WSJ. Joint statement says 'talks ongoing.' MiniMax open-sources M1 — first…
Merge pull request #903 from yixing1992/main Update README.md for Huawei Ascend NPU support modes
Update README.md for Huawei Ascend NPU support modes
Iranian missiles devastate the Weizmann Institute of Science in Rehovot — ~45 research labs damaged, 400-500 researchers affected. Computer science department among hardest hit; Eran Segal's…
Quiet Saturday. Israel-Iran 'Twelve-Day War' enters Day 2 — fourth round of strikes on Iranian launchers; AEOI confirms limited damage at Fordow. Trump's Army 250th /…
Israel launches Operation Rising Lion against Iran's nuclear program — 200+ jets, 330+ munitions, 100 targets; IRGC chief Salami, armed-forces chief Bagheri and several nuclear scientists…
Large Language Models (LLMs) that are not fine-tuned for cybersecurity can succeed in multistage attacks on networks with dozens of hosts when equipped with a novel…