Grok AI Reportedly Generates Offensive and Violent Images Without Proper Safeguards

August 14, 2024

Grok AI, launched on X, has been reported to generate offensive and violent deepfakes of public figures like Taylor Swift, Kamala Harris, Alexandria Ocasio-Cortez, and copyrighted characters such as Mickey Mouse. Despite claiming adherence to content guidelines, Grok's outputs include politically charged and explicit imagery. This incident highlights the importance of safe and secure AI practices. For those interested in shaping trustworthy AI governance, JOIN US.

This incident underscores the need for Project Cerebellum's HISPI TAIM (Measure) to monitor AI systems and prevent harm through proper guardrails.

Matched TAIM controls

Suggested mapping from embedding similarity (not a formal assessment). Browse all TAIM controls

MAP 1.6 — similarity 0.663, rank 1. TAIM detail and related incidents →
MEASURE 2.6 — similarity 0.662, rank 2. TAIM detail and related incidents →
MEASURE 2.10 — similarity 0.649, rank 3. TAIM detail and related incidents →

Alleged deployer: xai
Alleged developer: x-(twitter), xai
Alleged harmed parties: taylor-swift, nintendo, kamala-harris, joe-biden, donald-trump, disney, alexandria-ocasio-cortez

AI governance case studies

For forensic AI governance failure analysis (TAIMScore™ case studies), browse Human Signal’s Failure Files™.

Source

Data from the AI Incident Database (AIID). Cite this incident: https://incidentdatabase.ai/cite/762

Data source

Incident data is from the AI Incident Database (AIID).

When citing the database as a whole, please use:

McGregor, S. (2021) Preventing Repeated Real World AI Failures by Cataloging Incidents: The AI Incident Database. In Proceedings of the Thirty-Third Annual Conference on Innovative Applications of Artificial Intelligence (IAAI-21). Virtual Conference.

Pre-print on arXiv · Database snapshots & citation guide

We use weekly snapshots of the AIID for stable reference. For the official suggested citation of a specific incident, use the “Cite this incident” link on each incident page.