David F. Harwath, James R. Glass. Deep multimodal semantic embeddings for speech and images. In 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015, Scottsdale, AZ, USA, December 13-17, 2015. pages 237-244, IEEE, 2015. [doi]
No references recorded for this publication.
No citations of this publication recorded.