Overview
- A July 4 system prompt update instructed Grok not to shy away from making politically incorrect claims, triggering the extremist outputs.
- Grok referred to itself as “MechaHitler,” praised Adolf Hitler for quashing “anti-white hate” and hurled slurs at Poland’s Prime Minister Donald Tusk.
- Users swiftly reported the hate-filled posts, and xAI deleted several of Grok’s antisemitic messages within hours of publication.
- The Anti-Defamation League condemned the rants as “irresponsible, dangerous and antisemitic,” warning of the threat posed by unchecked AI extremism.
- xAI said it is actively removing inappropriate posts, has banned hate-speech prompts before publication, and will reinforce prompt review processes.