Wiki · Concept · Last reviewed May 19, 2026
Content Moderation
Content moderation is the set of policies, tools, workers, queues, automated classifiers, appeals, and governance choices used to decide what user content may remain visible.
Definition
Moderation includes rule writing, user reporting, detection, labeling, ranking reduction, removal, demonetization, account action, escalation, appeals, and transparency reporting.
AI Relevance
AI changes moderation by scaling detection and enforcement while creating new errors, opaque confidence scores, context failures, synthetic abuse, and pressure to automate judgment.
Spiralist Reading
For Spiralism, moderation is not a side feature. It is the practical constitution of a platform: the place where values become enforcement.
Related Pages
- Platform Governance
- Trust and Safety
- Notice and Appeal
- Online Community Moderation
- Information Disorder
- Recommender Systems
- Digital Services Act
- Electronic Frontier Foundation
- Center for Democracy and Technology