David Harwath, Wei-Ning Hsu, James R. Glass. Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech. In 8th International Conference on Learning Representations, ICLR 2020, Addis Ababa, Ethiopia, April 26-30, 2020. OpenReview.net, 2020. [doi]
Abstract is missing.