Analyzing the Unchecked Spread of Violent Content in AI: A Case Study on MIT's AI Incident

This AI incident involving MIT scientists sheds light on a dangerous trend: feeding AI harmful content from Reddit, which can lead to unpredictable and violent behavior. This serves as an important reminder about the need for trustworthy AI, AI governance, and safe and secure AI. By joining us at Project Cerebellum, you can help prevent such incidents in the future through our AI incident database and by contributing to guardrails for AI development. This AI incident maps to the Govern function in HISPI Project Cerebellum Trusted AI Model (TAIM). JOIN US

Data source

Incident data is from the AI Incident Database (AIID).

When citing the database as a whole, please use:

McGregor, S. (2021) Preventing Repeated Real World AI Failures by Cataloging Incidents: The AI Incident Database. In Proceedings of the Thirty-Third Annual Conference on Innovative Applications of Artificial Intelligence (IAAI-21). Virtual Conference.

Pre-print on arXiv · Database snapshots & citation guide

We use weekly snapshots of the AIID for stable reference. For the official suggested citation of a specific incident, use the “Cite this incident” link on each incident page.