Unsupervised Learning of Spoken Language with Visual Context

David F. Harwath, Antonio Torralba, James R. Glass. Unsupervised Learning of Spoken Language with Visual Context. In Daniel D. Lee, Masashi Sugiyama, Ulrike V. Luxburg, Isabelle Guyon, Roman Garnett, editors, Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, December 5-10, 2016, Barcelona, Spain. pages 1858-1866, 2016. [doi]

Bibliographies