FINDING · EVALUATION
CenDTect uses cross-classification accuracy — how well a decision tree trained on one domain's blocking pattern predicts another domain's blocking — as a distance metric to cluster domains that share the same blocking policy. This metric outperforms prior time-series approaches because it is interpretable (the resulting decision tree directly reveals the blocking mechanism: which ISP, which port, which protocol) rather than producing opaque anomaly scores. The approach scales to planetary-measurement volumes without requiring labelled training data.
From 2024-tsai-modeling — Modeling and Detecting Internet Censorship Events · §3 (CenDTect Design), §5.3 · 2024 · Network and Distributed System Security
Implications
- When building automated monitoring of circumvention server reachability, model blocking patterns as decision-tree classifiable features (port, protocol, ASN, time-of-day) rather than purely time-series anomalies — this produces actionable blocking-rule hypotheses rather than raw anomaly alerts.
- Heterogeneous blocking policies across ISPs within the same country are a confirmed, measurable phenomenon; circumvention tools should not assume a single national policy and must test each major ISP/AS independently.
Tags
Extracted by claude-sonnet-4-6 — review before relying.