Xiuyuan Chen, Yuan Lin, Yuchen Zhang, Weiran Huang 0001. AutoEval-Video: An Automatic Benchmark for Assessing Large Vision Language Models in Open-Ended Video Question Answering. In Ales Leonardis, Elisa Ricci 0001, Stefan Roth 0001, Olga Russakovsky, Torsten Sattler, Gül Varol, editors, Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part LXVIII. Volume 15126 of Lecture Notes in Computer Science, pages 179-195, Springer, 2024. [doi]
Abstract is missing.