Action Recognition via Fine-Tuned CLIP Model and Temporal Transformer

Xiaoyu Yang, Yuzhuo Fu, Ting Liu. Action Recognition via Fine-Tuned CLIP Model and Temporal Transformer. In Bin Sheng 0001, Lei Bi 0001, Jinman Kim, Nadia Magnenat-Thalmann, Daniel Thalmann, editors, Advances in Computer Graphics - 40th Computer Graphics International Conference, CGI 2023, Shanghai, China, August 28 - September 1, 2023, Proceedings, Part III. Volume 14497 of Lecture Notes in Computer Science, pages 498-513, Springer, 2023. [doi]

Abstract

Abstract is missing.