How to Protect Paper Models

OpenAI can rehabilitate AI models that develop a “bad-boy persona”

Researchers at the company looked into how malicious fine-tuning makes a model go rogue, and how to turn it back. A new paper from OpenAI has shown why a little bit of bad training can make AI models ...

TechCrunch

Research leaders urge tech industry to monitor AI’s ‘thoughts’

AI researchers from OpenAI, Google DeepMind, Anthropic, and a broad coalition of companies and nonprofit groups, are calling for deeper investigation into techniques for monitoring the so-called ...

TechCrunch

OpenAI’s research on AI models deliberately lying is wild

Every now and then, researchers at the biggest tech companies drop a bombshell. There was the time Google said its latest quantum chip indicated multiple universes exist. Or when Anthropic gave its AI ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

OpenAI can rehabilitate AI models that develop a “bad-boy persona”

Research leaders urge tech industry to monitor AI’s ‘thoughts’

OpenAI’s research on AI models deliberately lying is wild

Trending now