The following publications are possibly variants of this publication:
- Aligning Source Visual and Target Language Domains for Unpaired Video CaptioningFenglin Liu, Xian Wu, Chenyu You, Shen Ge, Yuexian Zou, Xu Sun 0001. pami, 44(12):9255-9268, 2022. [doi]
- Video captioning with text-based dynamic attention and step-by-step learningHuanhou Xiao, Jinglun Shi. prl, 133:305-312, 2020. [doi]