FINDING · DETECTION

Information Gain feature selection from 408 candidates identified informal language markers (informal, nonflu, swear), Chinese modal and general particles signaling mood and relational framing, and physical-feeling words used metaphorically as the top predictors of censored Weibo content — all with statistically significant differences between censored and uncensored classes.

From 2018-ng-detecting — Detecting Censorable Content on Sina Weibo: A Pilot Study · §5, §6 · 2018 · Hellenic Conference on Artificial Intelligence

Implications

Users seeking to survive Weibo platform censorship should shift toward formal, literal, objective prose — censored posts are disproportionately subjective, mood-marked, and informal.
A content-rewriting assist feature should target register shift (subjective→objective) and reduction of mood/relational particles rather than simple synonym substitution.