Vibe Code Bench: Evaluating AI Models on End-to-End Web Application Development

Hung Tran, Langston Nashold, Rayan Krishnan, Antoine Bigeard, Alex Gu. Vibe Code Bench: Evaluating AI Models on End-to-End Web Application Development. In Proceedings of the ACM Conference on AI and Agentic Systems, CAIS 2026, San Jose, CA, USA, May 26-29, 2026. pages 514-536, ACM, 2026. [doi]

Abstract

Abstract is missing.