Feifei Xu, Wenjing Zhu, Dongyang Li, Puzhe Li. Question-Aware Spatial-Temporal Reasoning in Patch for Audio-Visual Question Answering. In Jakub Lokoc, Ladislav Peska, Jan Zahálka, Stevan Rudinac, Marc A. Kastner 0001, Jingjing Chen, Min-Chun Hu 0001, Jiaxin Wu 0001, Ujjwal Sharma 0001, editors, MultiMedia Modeling - 32nd International Conference on Multimedia Modeling, MMM 2026, Prague, Czech Republic, January 29-31, 2026, Proceedings, Part II. Volume 16413 of Lecture Notes in Computer Science, pages 631-645, Springer, 2026. [doi]
Abstract is missing.