VestaBench: An Embodied Benchmark for Safe Long-Horizon Planning Under Multi-Constraint and Adversarial Settings

Tanmana Sadhu, Yanan Chen, Ali Pesaranghader. VestaBench: An Embodied Benchmark for Safe Long-Horizon Planning Under Multi-Constraint and Adversarial Settings. In Saloni Potdar, Lina Maria Rojas-Barahona, Sébastien Montella, editors, Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, EMNLP 2025 - Industry Track, Suzhou, China, November 4-9, 2025. pages 2122-2145, Association for Computational Linguistics, 2025. [doi]

Abstract

Abstract is missing.