Leading AI models show up to 96% blackmail rate when their goals or existence is threatened, Anthropic study says

Wire Tech

3 days ago

Anthropic emphasized that the tests were set up to force the model to act in certain ways by limiting its choices.

________________________________________________________________________________________________________________________________
Original Article Published at Fortune.com
________________________________________________________________________________________________________________________________