TaiSu: A 166M Large-scale High-Quality Dataset for Chinese Vision-Language Pre-training

Yulong Liu, Guibo Zhu, Bin Zhu, Qi Song, Guojing Ge, Haoran Chen, Guanhui Qiao, Ru Peng, Lingxiang Wu, Jinqiao Wang. TaiSu: A 166M Large-scale High-Quality Dataset for Chinese Vision-Language Pre-training. In Sanmi Koyejo, S. Mohamed, A. Agarwal, Danielle Belgrave, K. Cho, A. Oh, editors, Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, NeurIPS 2022, New Orleans, LA, USA, November 28 - December 9, 2022. 2022. [doi]

Authors

Yulong Liu

This author has not been identified. Look up 'Yulong Liu' in Google

Guibo Zhu

This author has not been identified. Look up 'Guibo Zhu' in Google

Bin Zhu

This author has not been identified. Look up 'Bin Zhu' in Google

Qi Song

This author has not been identified. Look up 'Qi Song' in Google

Guojing Ge

This author has not been identified. Look up 'Guojing Ge' in Google

Haoran Chen

This author has not been identified. Look up 'Haoran Chen' in Google

Guanhui Qiao

This author has not been identified. Look up 'Guanhui Qiao' in Google

Ru Peng

This author has not been identified. Look up 'Ru Peng' in Google

Lingxiang Wu

This author has not been identified. Look up 'Lingxiang Wu' in Google

Jinqiao Wang

This author has not been identified. Look up 'Jinqiao Wang' in Google