Progressive Spatio-temporal Perception for Audio-Visual Question Answering

Guangyao Li, Wenxuan Hou, Di Hu 0001. Progressive Spatio-temporal Perception for Audio-Visual Question Answering. In Abdulmotaleb El-Saddik, Tao Mei, Rita Cucchiara, Marco Bertini 0001, Diana Patricia Tobon Vallejo, Pradeep K. Atrey, M. Shamim Hossain, editors, Proceedings of the 31st ACM International Conference on Multimedia, MM 2023, Ottawa, ON, Canada, 29 October 2023- 3 November 2023. pages 7808-7816, ACM, 2023. [doi]

Authors

Guangyao Li

This author has not been identified. Look up 'Guangyao Li' in Google

Wenxuan Hou

This author has not been identified. Look up 'Wenxuan Hou' in Google

Di Hu 0001

This author has not been identified. Look up 'Di Hu 0001' in Google