Science ❯ Computer Science ❯ AI Research ❯ Model Evaluation
Routine safety training can largely neutralize such simple backdoors.