Learning Fine-Grained Semantics in Spoken Language Using Visual Grounding

Xinsheng Wang, Tian Tian, Jihua Zhu, Odette Scharenborg. Learning Fine-Grained Semantics in Spoken Language Using Visual Grounding. In IEEE International Symposium on Circuits and Systems, ISCAS 2021, Daegu, South Korea, May 22-28, 2021. pages 1-5, IEEE, 2021. [doi]

Abstract

Abstract is missing.