Deep multimodal semantic embeddings for speech and images

David F. Harwath, James R. Glass. Deep multimodal semantic embeddings for speech and images. In 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015, Scottsdale, AZ, USA, December 13-17, 2015. pages 237-244, IEEE, 2015. [doi]

Authors

David F. Harwath

This author has not been identified. Look up 'David F. Harwath' in Google

James R. Glass

This author has not been identified. Look up 'James R. Glass' in Google