A Multimodal, Multi-Task Adapting Framework for Video Action Recognition

Mengmeng Wang, Jiazheng Xing, Boyuan Jiang, Jun Chen 0023, Jianbiao Mei, Xingxing Zuo, Guang Dai, Jingdong Wang 0001, Yong Liu 0007. A Multimodal, Multi-Task Adapting Framework for Video Action Recognition. In Michael J. Wooldridge, Jennifer G. Dy, Sriraam Natarajan, editors, Thirty-Eigth AAAI Conference on Artificial Intelligence, AAAI 2024, Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence, IAAI 2024, Fourteenth Symposium on Educational Advances in Artificial Intelligence, EAAI 2014, February 20-27, 2024, Vancouver, Canada. pages 5517-5525, AAAI Press, 2024. [doi]

Abstract

Abstract is missing.