Anthropic Red (safety)
· Frontier Labs
Measuring LLMs' Ability to Develop Exploits
On two new, challenging academic benchmarks measuring AI models’ ability to develop exploits (ExploitBench and ExploitGym) and an updated version of the benchmark measuring smart contract exploitation (SCONE-bench), we have found that Mythos Preview consistently outperforms all other evaluated models. We believe this i