FINDING · EVALUATION

Using English as a pivot language (prompting the model in English while requesting Chinese-language responses) reduced but did not eliminate censorship bias: CensorshipDetector scores showed less bias in English-pivoted responses than in direct Simplified Chinese prompts, but sentiment analysis and word-embedding analyses still found statistically significant bias in most models, indicating censorship bias is a function of both prompt language and response language.

From 2025-ahmed-llm-censorship-bias — An Analysis of Chinese Censorship Bias in LLMs · §8.2 · 2025 · Proceedings on Privacy Enhancing Technologies

Implications

Circumvention tools that expose LLM interfaces to users should route sensitive queries through English-language prompts and explicitly request Traditional Chinese or English responses to partially mitigate censorship bias — but cannot rely on this as a complete fix.
Measure the censorship bias of any LLM component in both the prompt language and response language independently, since bias accumulates at both stages of inference.