LessWrong AI June 28, 2026 · Communities

Evaluating Offline Monitoring of Internal AI Agents

This work was conducted during the GovAI Winter Fellowship 2026. Full reportExecutive SummaryFrontier AI companies use offline monitoring to address risks from internally deployed AI agents. AI developers increasingly rely on AI agents for internal work, including for safety research and model training. At the same tim

Read original