Model Behavior Part 2

News

Cryptopolitan on MSN

OpenAI reorganizes teams, merging Model Behavior with Post Training

OpenAI has announced plans to reshuffle its Model Behavior team. The team is a group of researchers that shapes how the ...

AI models know when they're being tested - and change their behavior, research shows

New joint safety testing from UK-based nonprofit Apollo Research and OpenAI set out to reduce secretive behaviors like scheming in AI models. What researchers found could complicate promising ...

Wired

Why Anthropic’s New AI Model Sometimes Tries to ‘Snitch’

Anthropic’s alignment team was doing routine safety testing in the weeks leading up to the release of its latest AI models when researchers discovered something unsettling: When one of the models ...

Computerworld

OpenAI unveils ‘Model Spec’: A framework for shaping responsible AI

In a bid to improve accountability and transparency in AI development, OpenAI has released a preliminary draft of “Model Spec.” This first-of-its-kind document outlines the principles guiding model ...

Results that may be inaccessible to you are currently showing.

Hide inaccessible results