MAVIS: A Benchmark for Multimodal Source Attribution in Long-form Visual Question Answering

Seokwon Song, Minsu Park, Gunhee Kim. MAVIS: A Benchmark for Multimodal Source Attribution in Long-form Visual Question Answering. In Sven Koenig, Chad Jenkins, Matthew E. Taylor, editors, Fortieth AAAI Conference on Artificial Intelligence, Thirty-Eighth Conference on Innovative Applications of Artificial Intelligence, Sixteenth Symposium on Educational Advances in Artificial Intelligence, AAAI 2026, Singapore, January 20-27, 2026. pages 33028-33037, AAAI Press, 2026. [doi]

Authors

Seokwon Song

This author has not been identified. Look up 'Seokwon Song' in Google

Minsu Park

This author has not been identified. Look up 'Minsu Park' in Google

Gunhee Kim

This author has not been identified. Look up 'Gunhee Kim' in Google