Visually Grounded Models of Spoken Language: A Survey of Datasets, Architectures and Evaluation Techniques

Grzegorz Chrupala. Visually Grounded Models of Spoken Language: A Survey of Datasets, Architectures and Evaluation Techniques. J. Artif. Intell. Res. (JAIR), 73:673-707, 2022. [doi]

Authors

Grzegorz Chrupala

This author has not been identified. Look up 'Grzegorz Chrupala' in Google