General
impact 12
BigCodeArena: Judging code generations end to end with code executions
BigCodeArena: Judging code generations end to end with code executions
Why it matters
Context is key—code has been building for months. This development could accelerate changes in bigcodearena.