Video captioning based on vision transformer and reinforcement learning

Hong Zhao, Zhiwen Chen, Lan Guo, Zeyu Han. Video captioning based on vision transformer and reinforcement learning. PeerJ Computer Science, 8, 2022. [doi]

Abstract

Abstract is missing.