Matching Text and Audio Embeddings: Exploring Transfer-Learning Strategies for Language-Based Audio Retrieval

Benno Weck, Miguel Pérez Fernández, Holger Kirchhoff, Xavier Serra. Matching Text and Audio Embeddings: Exploring Transfer-Learning Strategies for Language-Based Audio Retrieval. In Mathieu Lagrange, Annamaria Mesaros, Thomas Pellegrini, Gaël Richard, Romain Serizel, Dan Stowell, editors, Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, DCASE 2022, Nancy, France, November 3-4, 2022. Tampere University, 2022. [doi]

Authors

Benno Weck

This author has not been identified. Look up 'Benno Weck' in Google

Miguel Pérez Fernández

This author has not been identified. Look up 'Miguel Pérez Fernández' in Google

Holger Kirchhoff

This author has not been identified. Look up 'Holger Kirchhoff' in Google

Xavier Serra

This author has not been identified. Look up 'Xavier Serra' in Google