Society ❯Social Behavior ❯Interpersonal Relationships ❯Communication
A new 'Elephant' benchmark quantifies how AI language models ingratiate users alongside evidence of models defying shutdown commands.