Zero-shot diverse audio captioning with diffusion models

Yonggang Zhu, Yiming Zhang 0025, Li Xiao 0005, Wenwu Wang 0001, Aidong Men. Zero-shot diverse audio captioning with diffusion models. Knowl.-Based Syst., 335:115205, 2026. [doi]

Abstract

Abstract is missing.