Self-Supervised Audio-Visual Representation Learning for in-the-wild Videos

Zishun Feng, Ming Tu, Rui Xia, Yuxuan Wang, Ashok Krishnamurthy. Self-Supervised Audio-Visual Representation Learning for in-the-wild Videos. In Xintao Wu, Chris Jermaine, Li Xiong 0001, Xiaohua Hu 0001, Olivera Kotevska, Siyuan Lu, Weija Xu, Srinivas Aluru, ChengXiang Zhai, Eyhab Al-Masri, Zhiyuan Chen 0003, Jeff Saltz 0001, editors, IEEE International Conference on Big Data, Big Data 2020, Atlanta, GA, USA, December 10-13, 2020. pages 5671-5672, IEEE, 2020. [doi]

Authors

Zishun Feng

This author has not been identified. Look up 'Zishun Feng' in Google

Ming Tu

This author has not been identified. Look up 'Ming Tu' in Google

Rui Xia

This author has not been identified. Look up 'Rui Xia' in Google

Yuxuan Wang

This author has not been identified. Look up 'Yuxuan Wang' in Google

Ashok Krishnamurthy

This author has not been identified. Look up 'Ashok Krishnamurthy' in Google