LessWrong AI
· Communities
Evaluating Offline Monitoring of Internal AI Agents
This work was conducted during the GovAI Winter Fellowship 2026. Full reportExecutive SummaryFrontier AI companies use offline monitoring to address risks from internally deployed AI agents. AI developers increasingly rely on AI agents for internal work, including for safety research and model training. At the same tim