News

OpenAI has announced plans to reshuffle its Model Behavior team. The team is a group of researchers that shapes how the ...
New joint safety testing from UK-based nonprofit Apollo Research and OpenAI set out to reduce secretive behaviors like scheming in AI models. What researchers found could complicate promising ...
Anthropic’s alignment team was doing routine safety testing in the weeks leading up to the release of its latest AI models when researchers discovered something unsettling: When one of the models ...
In a bid to improve accountability and transparency in AI development, OpenAI has released a preliminary draft of “Model Spec.” This first-of-its-kind document outlines the principles guiding model ...