Twitter's AI Moderation Tool Misidentified Rockets as Pornography

January 3, 2023

Twitter's automated content moderation tool inadvertently misclassified images of rocket launches as explicit content, leading to incorrect account suspensions. This underscores the importance of responsible AI governance and the need for trustworthy AI systems that can accurately identify and categorize content.

For those interested in shaping the future of AI incident management, learn how the HISPI Project Cerebellum TAIM (Govern, Map, Measure, or Manage) can help prevent such incidents and ensure safe and secure AI practices. JOIN US

Matched TAIM controls

Suggested mapping from embedding similarity (not a formal assessment). Browse all TAIM controls

Alleged deployer
twitter
Alleged developer
twitter
Alleged harmed parties
twitter-users

Source

Data from the AI Incident Database (AIID). Cite this incident: https://incidentdatabase.ai/cite/453

Data source

Incident data is from the AI Incident Database (AIID).

When citing the database as a whole, please use:

McGregor, S. (2021) Preventing Repeated Real World AI Failures by Cataloging Incidents: The AI Incident Database. In Proceedings of the Thirty-Third Annual Conference on Innovative Applications of Artificial Intelligence (IAAI-21). Virtual Conference.

Pre-print on arXiv · Database snapshots & citation guide

We use weekly snapshots of the AIID for stable reference. For the official suggested citation of a specific incident, use the “Cite this incident” link on each incident page.