Anthropic Red (safety)
· Frontier Labs
Developing Nuclear Safeguards for AI
Together with the NNSA and DOE national laboratories, we have co-developed a classifier—an AI system that automatically categorizes content—that distinguishes between concerning and benign nuclear-related conversations with high accuracy in preliminary testing.