Masked Lip-Sync Prediction by Audio-Visual Contextual Exploitation in Transformers

Yasheng Sun, Hang Zhou, Kaisiyuan Wang, Qianyi Wu, Zhibin Hong, Jingtuo Liu, Errui Ding, Jingdong Wang 0001, Ziwei Liu, Hideki Koike. Masked Lip-Sync Prediction by Audio-Visual Contextual Exploitation in Transformers. In Soon Ki Jung, Jehee Lee, Adam W. Bargteil, editors, SIGGRAPH Asia 2022 Conference Papers, SA 2022, Daegu, Republic of Korea, December 6-9, 2022. ACM, 2022. [doi]

Abstract

Abstract is missing.