Learning Fine-Grained Semantics in Spoken Language Using Visual Grounding

Xinsheng Wang, Tian Tian, Jihua Zhu, Odette Scharenborg. Learning Fine-Grained Semantics in Spoken Language Using Visual Grounding. In IEEE International Symposium on Circuits and Systems, ISCAS 2021, Daegu, South Korea, May 22-28, 2021. pages 1-5, IEEE, 2021. [doi]

Authors

Xinsheng Wang

This author has not been identified. Look up 'Xinsheng Wang' in Google

Tian Tian

This author has not been identified. Look up 'Tian Tian' in Google

Jihua Zhu

This author has not been identified. Look up 'Jihua Zhu' in Google

Odette Scharenborg

This author has not been identified. Look up 'Odette Scharenborg' in Google