Movie Caption Generation with Vision Transformer and Transformer-based Language Model

Sorato Nakamura, Hidekazu Yanagimoto, Kiyota Hashimoto. Movie Caption Generation with Vision Transformer and Transformer-based Language Model. In 14th IIAI International Congress on Advanced Applied Informatics, IIAI-AAI 2023, Koriyama, Japan, July 8-13, 2023. pages 88-93, IEEE, 2023. [doi]

Abstract

Abstract is missing.