Can Audio Captions Be Evaluated With Image Caption Metrics?

Zelin Zhou, Zhiling Zhang, Xuenan Xu, Zeyu Xie, Mengyue Wu, Kenny Q. Zhu. Can Audio Captions Be Evaluated With Image Caption Metrics?. In IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022, Virtual and Singapore, 23-27 May 2022. pages 981-985, IEEE, 2022. [doi]

@inproceedings{ZhouZXXWZ22,
  title = {Can Audio Captions Be Evaluated With Image Caption Metrics?},
  author = {Zelin Zhou and Zhiling Zhang and Xuenan Xu and Zeyu Xie and Mengyue Wu and Kenny Q. Zhu},
  year = {2022},
  doi = {10.1109/ICASSP43922.2022.9746427},
  url = {https://doi.org/10.1109/ICASSP43922.2022.9746427},
  researchr = {https://researchr.org/publication/ZhouZXXWZ22},
  cites = {0},
  citedby = {0},
  pages = {981-985},
  booktitle = {IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022, Virtual and Singapore, 23-27 May 2022},
  publisher = {IEEE},
  isbn = {978-1-6654-0540-9},
}