Matching Text and Audio Embeddings: Exploring Transfer-Learning Strategies for Language-Based Audio Retrieval

Benno Weck, Miguel Pérez Fernández, Holger Kirchhoff, Xavier Serra. Matching Text and Audio Embeddings: Exploring Transfer-Learning Strategies for Language-Based Audio Retrieval. In Mathieu Lagrange, Annamaria Mesaros, Thomas Pellegrini, Gaël Richard, Romain Serizel, Dan Stowell, editors, Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, DCASE 2022, Nancy, France, November 3-4, 2022. Tampere University, 2022. [doi]

@inproceedings{WeckFKS22,
  title = {Matching Text and Audio Embeddings: Exploring Transfer-Learning Strategies for Language-Based Audio Retrieval},
  author = {Benno Weck and Miguel Pérez Fernández and Holger Kirchhoff and Xavier Serra},
  year = {2022},
  url = {https://dcase.community/documents/workshop2022/proceedings/DCASE2022Workshop_Weck_72.pdf},
  researchr = {https://researchr.org/publication/WeckFKS22},
  cites = {0},
  citedby = {0},
  booktitle = {Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, DCASE 2022, Nancy, France, November 3-4, 2022},
  editor = {Mathieu Lagrange and Annamaria Mesaros and Thomas Pellegrini and Gaël Richard and Romain Serizel and Dan Stowell},
  publisher = {Tampere University},
  isbn = {978-952-03-2677-7},
}