Skip to content
METR · Tech Media

Observations from two CLI game reimplementation runs with Opus 4.6

Summary: Opus 4.6 can, with a simple agent scaffold, create mostly-playable but somewhat broken CLI versions of Slay the Spire and Balatro1. Intro Last weekend I was trying to think of really difficult tasks we could give to AI agents to upper-bound their capabilities. I thought of two examples: Recreating a basic vers