An Opinionated Guide to Using AI Right Now
What AI to use in late 2025
What AI to use in late 2025
Multiple-Choice Benchmarks, Verifiers, Leaderboards, and LLM Judges with Code Examples
The race between human-centered work and infinite PowerPoints
Verifying magic on the jagged frontier
And a big change for this newsletter
A Detailed Look at One of the Leading Open-Source LLMs
From GPT-5 to nano banana: everyone is getting access to powerful AI
And How They Stack Up Against Qwen3
Putting the AI in Charge
Does process matter? We are about to find out.
From DeepSeek-V3 to Kimi K2: A Look At Modern LLM Architecture Design
Confronting the production-progress paradox
AI can help, or hurt, our thinking
A topic-organized collection of 200+ LLM research papers from 2025
Which AIs to use, and how to use them
KV caches are one of the most critical techniques for efficient inference in LLMs in production.
"In projecting language back as the model for thought, we lose sight of the tacit embodied understanding that undergirds our intelligence." –Terry WinogradThe recent successes of…
Why build LLMs from scratch? It's probably the best and most efficient way to learn how LLMs really work. Plus, many readers have told me they…
There is no capability threshold that will lead to sudden impacts
Understanding GRPO and New Insights from Reasoning Model Papers
A new paper that we will expand into our next book
Welcome to the next stage of large language models (LLMs): reasoning. LLMs have transformed how we process and generate text, but their success has been largely…
Making sense of recent technology trends and claims
Technology Isn’t the Problem—or the Solution.