Science ❯ Computer Science ❯ Artificial Intelligence
Bias in AI AI Safety Risk Assessment Explainable AI Safety and Alignment Behavioral Control Human Impact Misuse of AI Value Alignment Motivated Reasoning Fairness in AI Data Compliance Safety Concerns Public Benefit Corporations AI Limitations
The revision elevates misalignment plus persuasion into formal risk thresholds with mandatory safety reviews before release.