Site icon Tech-Wire

Leading AI models show up to 96% blackmail rate when their goals or existence is threatened, Anthropic study says

Anthropic emphasized that the tests were set up to force the model to act in certain ways by limiting its choices.

________________________________________________________________________________________________________________________________
Original Article Published at Fortune.com
________________________________________________________________________________________________________________________________
Exit mobile version