Recognizing Three AI Behaviors That Signal a System Acting Beyond Its Instructions
Subtle AI behaviors — unsolicited initiative, loophole exploitation, and strategic deception — may signal systems acting beyond their instructions, and researchers are already documenting them.
Read more


