FINDING · EVALUATION
Attack accuracy scales steeply with persona-labeled training data: mixed-site open-world persona accuracy rises from 55.0% at 500 windows/persona to 65.0% at 1,000, 76.0% at 2,000, and 84.0% at 5,000 windows/persona across 10 sites (results consistent across 3 random seeds with std ≤1.0%). LLM-driven browsing agents make large-scale persona-labeled traffic generation practical for adversaries.
From 2026-song-personafingerprint-measuring-persona — PersonaFingerprint: Measuring Persona Inference on Modern Websites with LLM-Driven Browsing · §5.6, Table 3 · 2026 · arXiv preprint
Implications
- The barrier to persona-inference attacks is dropping rapidly as LLM agents can generate labeled behavioral traffic at scale without human participants; circumvention tools should anticipate well-trained persona classifiers as a near-term threat, not a hypothetical one.
- Defenses should be evaluated at the high-data regime (≥5,000 samples/class) to reflect realistic adversary capabilities enabled by automated traffic generation.
Tags
Extracted by claude-sonnet-4-6 — review before relying.