Connecting What To Say With Where To Look by Modeling Human Attention Traces

Zihang Meng, Licheng Yu, Ning Zhang, Tamara L. Berg, Babak Damavandi, Vikas Singh, Amy Bearman. Connecting What To Say With Where To Look by Modeling Human Attention Traces. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2021, virtual, June 19-25, 2021. pages 12679-12688, Computer Vision Foundation / IEEE, 2021. [doi]

Abstract

Abstract is missing.