Competitive Audio-Language Models with Data-Efficient Single-Stage Training on Public Data

Gokul Karthik Kumar, Rishabh Saraf, Ludovick Lepauloux, Abdul Muneer, Billel Mokeddem, Hakim Hacid. Competitive Audio-Language Models with Data-Efficient Single-Stage Training on Public Data. In IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2025, Honolulu, HI, USA, December 6-10, 2025. pages 1-8, IEEE, 2025. [doi]

@inproceedings{KumarSLMMH25,
  title = {Competitive Audio-Language Models with Data-Efficient Single-Stage Training on Public Data},
  author = {Gokul Karthik Kumar and Rishabh Saraf and Ludovick Lepauloux and Abdul Muneer and Billel Mokeddem and Hakim Hacid},
  year = {2025},
  doi = {10.1109/ASRU65441.2025.11434596},
  url = {https://doi.org/10.1109/ASRU65441.2025.11434596},
  researchr = {https://researchr.org/publication/KumarSLMMH25},
  cites = {0},
  citedby = {0},
  pages = {1-8},
  booktitle = {IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2025, Honolulu, HI, USA, December 6-10, 2025},
  publisher = {IEEE},
  isbn = {979-8-3315-4426-3},
}