Anthropic Red (safety) May 22, 2026 · Frontier Labs

Measuring LLMs' Ability to Develop Exploits

On two new, challenging academic benchmarks measuring AI models’ ability to develop exploits (ExploitBench and ExploitGym) and an updated version of the benchmark measuring smart contract exploitation (SCONE-bench), we have found that Mythos Preview consistently outperforms all other evaluated models. We believe this i

Read original