Be Specific, Be Clear: Bridging Machine and Human Captions by Scene-Guided Transformer

Yupan Huang, Zhaoyang Zeng, Yutong Lu. Be Specific, Be Clear: Bridging Machine and Human Captions by Scene-Guided Transformer. In Bei Liu 0001, Jianlong Fu, Shizhe Chen, Qin Jin, Alexander G. Hauptmann, Yong Rui, editors, MMPT@ICMR2021: Proceedings of the 2021 Workshop on Multi-Modal Pre-Training for Multimedia Understanding, Taipei, Taiwan, August 21, 2021. pages 4-13, ACM, 2021. [doi]

Abstract

Abstract is missing.