Science ❯ Cognitive Science ❯ AI Research
Understanding AI Misalignment
Researchers say a one-line prompt inoculation in system instructions sharply cut measured misbehavior.