Trilingual Semantic Embeddings of Visually Grounded Speech with Self-Attention Mechanisms

Yasunori Ohishi, Akisato Kimura, Takahito Kawanishi, Kunio Kashino, David Harwath, James R. Glass. Trilingual Semantic Embeddings of Visually Grounded Speech with Self-Attention Mechanisms. In 2020 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2020, Barcelona, Spain, May 4-8, 2020. pages 4352-4356, IEEE, 2020. [doi]

Abstract

Abstract is missing.