FINDING · EVALUATION

Using English as a pivot language (prompting the model in English while requesting Chinese-language responses) reduced but did not eliminate censorship bias: CensorshipDetector scores showed less bias in English-pivoted responses than in direct Simplified Chinese prompts, but sentiment analysis and word-embedding analyses still found statistically significant bias in most models, indicating censorship bias is a function of both prompt language and response language.

From 2025-ahmed-llm-censorship-biasAn Analysis of Chinese Censorship Bias in LLMs · §8.2 · 2025 · Proceedings on Privacy Enhancing Technologies

Implications

Tags

censors
cn
techniques
keyword-filtering

Extracted by claude-sonnet-4-6 — review before relying.