Language-based Audio Retrieval with GPT-Augmented Captions and Self-Attended Audio Clips

Fuyu Gu, Yang Gu, Yiyan Xu, Haoran Sun, Yushan Pan, Shengchen Li, Haiyang Zhang. Language-based Audio Retrieval with GPT-Augmented Captions and Self-Attended Audio Clips. In Weiming Shen 0001, Jean-Paul A. Barthès, Junzhou Luo, Tie Qiu 0001, Xiaobo Zhou 0003, Jinghui Zhang, Haibin Zhu, Kunkun Peng, Tianyi Xu, Ning Chen 0008, editors, 27th International Conference on Computer Supported Cooperative Work in Design, CSCWD 2024, Tianjin, China, May 8-10, 2024. pages 858-863, IEEE, 2024. [doi]

Abstract

Abstract is missing.