Eugene Yan
· Tech Media
Patterns for Building Cybersecurity Evals
A sandboxed target, inputs that influence task difficulty, tools, and a grader.
A sandboxed target, inputs that influence task difficulty, tools, and a grader.