AI & ML
impact 16
Evaluating Answer Leakage Robustness of LLM Tutors against Adversarial Student Attacks
Evaluating Answer Leakage Robustness of LLM Tutors against Adversarial Student Attacks arXiv:2604.18660v1 Announce Type: cross Abstract: Large Language Models (LLMs) are increasingly used in education, yet their default…
Why it matters
This adds a new dimension to the evaluating conversation. Practitioners should assess exposure to answer changes.