SILA: Signal-to-Language Augmentation for Enhanced Control in Text-to-Audio Generation

Sonal Kumar, Prem Seetharaman, Justin Salamon, Dinesh Manocha, Oriol Nieto. SILA: Signal-to-Language Augmentation for Enhanced Control in Text-to-Audio Generation. In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2025, Tahoe City, CA, USA, October 12-15, 2025. pages 1-5, IEEE, 2025. [doi]

@inproceedings{KumarSSMN25,
  title = {SILA: Signal-to-Language Augmentation for Enhanced Control in Text-to-Audio Generation},
  author = {Sonal Kumar and Prem Seetharaman and Justin Salamon and Dinesh Manocha and Oriol Nieto},
  year = {2025},
  doi = {10.1109/WASPAA66052.2025.11230964},
  url = {https://doi.org/10.1109/WASPAA66052.2025.11230964},
  researchr = {https://researchr.org/publication/KumarSSMN25},
  cites = {0},
  citedby = {0},
  pages = {1-5},
  booktitle = {IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2025, Tahoe City, CA, USA, October 12-15, 2025},
  publisher = {IEEE},
  isbn = {979-8-3315-3745-6},
}