Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer

Guangyi Chen, Xiao Liu, Guangrun Wang, Kun Zhang 0001, Philip H. S. Torr, Xiao-Ping Zhang, Yansong Tang. Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer. In IEEE/CVF International Conference on Computer Vision, ICCV 2023, Paris, France, October 1-6, 2023. pages 13899-13909, IEEE, 2023. [doi]

Abstract

Abstract is missing.