FINDING · DETECTION
Information Gain feature selection from 408 candidates identified informal language markers (informal, nonflu, swear), Chinese modal and general particles signaling mood and relational framing, and physical-feeling words used metaphorically as the top predictors of censored Weibo content — all with statistically significant differences between censored and uncensored classes.
From 2018-ng-detecting — Detecting Censorable Content on Sina Weibo: A Pilot Study · §5, §6 · 2018 · Hellenic Conference on Artificial Intelligence
Implications
- Users seeking to survive Weibo platform censorship should shift toward formal, literal, objective prose — censored posts are disproportionately subjective, mood-marked, and informal.
- A content-rewriting assist feature should target register shift (subjective→objective) and reduction of mood/relational particles rather than simple synonym substitution.
Tags
Extracted by claude-sonnet-4-6 — review before relying.