Science ❯ Computer Science ❯ AI Research
Performance Metrics HealthBench Performance Analysis Accuracy Improvement Experimental Methods Backdoor Attacks
Routine safety training can largely neutralize such simple backdoors.