Science ❯ Computer Science ❯ Artificial Intelligence
Reinforcement Learning Deep Learning Data Challenges Parameter-efficient Fine-tuning Learning Behaviors Data Annotation Synthetic Data Generation Data Poisoning Conversational Agents
Routine safety training can largely neutralize such simple backdoors.