Skip to content
Eugene Yan · Tech Media

Patterns for Building Cybersecurity Evals

A sandboxed target, inputs that influence task difficulty, tools, and a grader.