Privacy
impact 16
Option-Order Randomisation Reveals a Distributional Position Attractor in Prompted Sandbagging
Option-Order Randomisation Reveals a Distributional Position Attractor in Prompted Sandbagging arXiv:2604.26206v1 Announce Type: cross Abstract: A predecessor pilot (Cacioli, 2026) found that Llama-3-8B implements promp…
Why it matters
The timing matters: optionorder is converging with shifts in randomisation, which could amplify the downstream impact.