Eugene Yan
· Tech Media
Product Evals in Three Simple Steps
Label some data, align LLM-evaluators, and run the eval harness with each change.
Label some data, align LLM-evaluators, and run the eval harness with each change.