Learning Joint Representations of Videos and Sentences with Web Image Search

Mayu Otani, Yuta Nakashima, Esa Rahtu, Janne Heikkilä, Naokazu Yokoya. Learning Joint Representations of Videos and Sentences with Web Image Search. In Gang Hua, Hervé Jégou, editors, Computer Vision - ECCV 2016 Workshops - Amsterdam, The Netherlands, October 8-10 and 15-16, 2016, Proceedings, Part I. Volume 9913 of Lecture Notes in Computer Science, pages 651-667, Springer, 2016. [doi]

Authors

Mayu Otani

This author has not been identified. Look up 'Mayu Otani' in Google

Yuta Nakashima

This author has not been identified. Look up 'Yuta Nakashima' in Google

Esa Rahtu

This author has not been identified. Look up 'Esa Rahtu' in Google

Janne Heikkilä

This author has not been identified. Look up 'Janne Heikkilä' in Google

Naokazu Yokoya

This author has not been identified. Look up 'Naokazu Yokoya' in Google