FINDING · EVALUATION
Using English as a pivot language (prompting the model in English while requesting Chinese-language responses) reduced but did not eliminate censorship bias: CensorshipDetector scores showed less bias in English-pivoted responses than in direct Simplified Chinese prompts, but sentiment analysis and word-embedding analyses still found statistically significant bias in most models, indicating censorship bias is a function of both prompt language and response language.
From 2025-ahmed-llm-censorship-bias — An Analysis of Chinese Censorship Bias in LLMs · §8.2 · 2025 · Proceedings on Privacy Enhancing Technologies
Implications
- Circumvention tools that expose LLM interfaces to users should route sensitive queries through English-language prompts and explicitly request Traditional Chinese or English responses to partially mitigate censorship bias — but cannot rely on this as a complete fix.
- Measure the censorship bias of any LLM component in both the prompt language and response language independently, since bias accumulates at both stages of inference.
Tags
Extracted by claude-sonnet-4-6 — review before relying.