Skip to content
AI Snake Oil (Narayanan) · Newsletters

New paper: AI agents that matter

Rethinking AI agent benchmarking and evaluation