LWiAI Podcast #237 – Nemotron 3 Super, xAI reborn, Anthropic Lawsuit, Research!
Nemotron 3 Super: An Open Hybrid Mamba-Transformer MoE for Agentic Reasoning, Another XAI Cofounder Has Left, Anthropic Sues Department of Defense
Nemotron 3 Super: An Open Hybrid Mamba-Transformer MoE for Agentic Reasoning, Another XAI Cofounder Has Left, Anthropic Sues Department of Defense
Anthropic sues Trump administration in AI dispute with Pentagon, ‘Not built right the first time’ — Musk’s xAI is starting over again, again, Cascade of A.I.…
We reviewed two versions of Anthropic’s Sabotage Risk Report for Claude Opus 4.6, producing two corresponding review documents: our review of the February 11 version and…
Anthropic officially told by DOD that it’s a supply chain risk, ‘cancel ChatGPT’ trend is growing after OpenAI signs a deal with the US military, and…
RT AnthropicA statement from Anthropic CEO Dario Amodei: https://www.anthropic.com/news/where-stand-department-war
Evaluating Opus 4.6 on BrowseComp, we found cases where the model recognized the test, then found and decrypted answers to it—raising questions about eval integrity in…
This post dives deep into how Claude wrote an exploit for one of the vulnerabilities it found in Firefox.
The Batch AI News and Insights: I’m thrilled to announce Context Hub, a new tool to give to your coding agents the API documentation they need…
Anthropic releases Sonnet 4.6, Google Rolls Out Gemini 3.1 Pro, Anthropic CEO Amodei says Pentagon’s threats ‘do not change our position’ on AI
Summary: Opus 4.6 can, with a simple agent scaffold, create mostly-playable but somewhat broken CLI versions of Slay the Spire and Balatro1. Intro Last weekend I…
RT AnthropicA statement from Anthropic CEO, Dario Amodei, on our discussions with the Department of War.https://www.anthropic.com/news/statement-department-of-war
Anthropic releases Sonnet 4.6, Google Rolls Out Latest AI Model Gemini 3.1 Pro, Pentagon threatens to cut off Anthropic in AI safeguards dispute
Claude Sonnet 4.6 is now available in Windsurf with limited-time promotional pricing for self serve users: 2x credits without thinking and 3x credits with thinking.
An action-packed episode!
A crazy packed edition of Last Week in AI! Plus some small updates.
Ollama now supports subagents and web search in Claude Code.
Most of METR’s time horizon measurements are done using two scaffolds: Triframe and ReAct1. People sometimes see that we use these two scaffolds and feel skeptical…
The Batch AI News and Insights: I recently spoke at the Sundance Film Festival on a panel about AI.
Use the Pinecone Plugin for Claude Code to develop AI Applications Faster
Claude Opus 4.6 (fast mode) is now available in Windsurf with limited-time promotional pricing for self serve users: 10x credits without thinking and 12x credits with…
Claude Opus 4.6 is now available in Windsurf with limited-time promotional pricing for self serve users: 2x credits without thinking and 3x credits with thinking. Available…
AnnouncementsIntroducing Claude Opus 4.6Feb 5, 2026We`re upgrading our smartest model.The new Claude Opus 4.6 improves on its predecessor`s coding skills. It plans more carefully, sustains agentic…
Infrastructure configuration can swing agentic coding benchmarks by several percentage points—sometimes more than the leaderboard gap between top models.nn
We tasked Opus 4.6 using agent teams to build a C Compiler, and then (mostly) walked away. Here's what it taught us about the future of…