Deep multimodal semantic embeddings for speech and images

David F. Harwath, James R. Glass. Deep multimodal semantic embeddings for speech and images. In 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015, Scottsdale, AZ, USA, December 13-17, 2015. pages 237-244, IEEE, 2015. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.