Meta Platforms Users Report Being Wrongfully Locked Out After Purported AI Moderation Flags Accounts for Child Exploitation Content

July 2, 2025

Instagram, Facebook, and WhatsApp users have shared reports of being unjustly locked out following AI moderation flags for child exploitation content. These individuals assert their innocence and face the loss of personal and professional data. Experts suggest over-reliance on AI moderation may be at fault. Meta has yet to clarify or rectify these suspensions. For those interested in fostering responsible AI governance and ensuring safe and secure AI practices, consider joining Project Cerebellum as we work towards guardrails for AI, including the HISPI Project Cerebellum TAIM (Govern) function.

Read more about this incident and others in our AI incident database, dedicated to harm prevention through Project Cerebellum.

Matched TAIM controls

Suggested mapping from embedding similarity (not a formal assessment). Browse all TAIM controls

MAP 1.6 — similarity 0.671, rank 1. TAIM detail and related incidents →
MEASURE 2.10 — similarity 0.668, rank 2. TAIM detail and related incidents →
MANAGE 4.3 — similarity 0.667, rank 3. TAIM detail and related incidents →

Alleged deployer: meta
Alleged developer: meta
Alleged harmed parties: meta-users, instagram-users, facebook-users, whatsapp-users

Source

Data from the AI Incident Database (AIID). Cite this incident: https://incidentdatabase.ai/cite/1219

Data source

Incident data is from the AI Incident Database (AIID).

When citing the database as a whole, please use:

McGregor, S. (2021) Preventing Repeated Real World AI Failures by Cataloging Incidents: The AI Incident Database. In Proceedings of the Thirty-Third Annual Conference on Innovative Applications of Artificial Intelligence (IAAI-21). Virtual Conference.

Pre-print on arXiv · Database snapshots & citation guide

We use weekly snapshots of the AIID for stable reference. For the official suggested citation of a specific incident, use the “Cite this incident” link on each incident page.