Joint learning of images and videos with a single Vision Transformer

Shuki Shimizu, Toru Tamaki. Joint learning of images and videos with a single Vision Transformer. In 18th International Conference on Machine Vision and Applications, MVA 2023, Hamamatsu, Japan, July 23-25, 2023. pages 1-6, IEEE, 2023. [doi]

Abstract

Abstract is missing.