Grzegorz Chrupala. Visually Grounded Models of Spoken Language: A Survey of Datasets, Architectures and Evaluation Techniques. J. Artif. Intell. Res. (JAIR), 73:673-707, 2022. [doi]
No references recorded for this publication.
No citations of this publication recorded.