Privacy impact 16

Option-Order Randomisation Reveals a Distributional Position Attractor in Prompted Sandbagging

Option-Order Randomisation Reveals a Distributional Position Attractor in Prompted Sandbagging arXiv:2604.26206v1 Announce Type: cross Abstract: A predecessor pilot (Cacioli, 2026) found that Llama-3-8B implements promp…

Why it matters

The timing matters: optionorder is converging with shifts in randomisation, which could amplify the downstream impact.

Read full article at arXiv AI →

Get the digest in your inbox

Top stories, ranked by impact. No spam, unsubscribe anytime.