BLAT: Bootstrapping Language-Audio Pre-training based on AudioSet Tag-guided Synthetic Data

Xuenan Xu, Zhiling Zhang, Zelin Zhou, Pingyue Zhang, Zeyu Xie, Mengyue Wu, Kenny Q. Zhu. BLAT: Bootstrapping Language-Audio Pre-training based on AudioSet Tag-guided Synthetic Data. In Abdulmotaleb El-Saddik, Tao Mei, Rita Cucchiara, Marco Bertini 0001, Diana Patricia Tobon Vallejo, Pradeep K. Atrey, M. Shamim Hossain, editors, Proceedings of the 31st ACM International Conference on Multimedia, MM 2023, Ottawa, ON, Canada, 29 October 2023- 3 November 2023. pages 2756-2764, ACM, 2023. [doi]

Authors

Xuenan Xu

This author has not been identified. Look up 'Xuenan Xu' in Google

Zhiling Zhang

This author has not been identified. Look up 'Zhiling Zhang' in Google

Zelin Zhou

This author has not been identified. Look up 'Zelin Zhou' in Google

Pingyue Zhang

This author has not been identified. Look up 'Pingyue Zhang' in Google

Zeyu Xie

This author has not been identified. Look up 'Zeyu Xie' in Google

Mengyue Wu

This author has not been identified. Look up 'Mengyue Wu' in Google

Kenny Q. Zhu

This author has not been identified. Look up 'Kenny Q. Zhu' in Google