Video captioning based on vision transformer and reinforcement learning

Hong Zhao, Zhiwen Chen, Lan Guo, Zeyu Han. Video captioning based on vision transformer and reinforcement learning. PeerJ Computer Science, 8, 2022. [doi]

Authors

Hong Zhao

This author has not been identified. Look up 'Hong Zhao' in Google

Zhiwen Chen

This author has not been identified. Look up 'Zhiwen Chen' in Google

Lan Guo

This author has not been identified. Look up 'Lan Guo' in Google

Zeyu Han

This author has not been identified. Look up 'Zeyu Han' in Google