Anthropic Red (safety) August 21, 2025 · Frontier Labs

Developing Nuclear Safeguards for AI

Together with the NNSA and DOE national laboratories, we have co-developed a classifier—an AI system that automatically categorizes content—that distinguishes between concerning and benign nuclear-related conversations with high accuracy in preliminary testing.

Read original