LipFormer: Learning to Lipread Unseen Speakers Based on Visual-Landmark Transformers

Feng Xue, Yu Li, Deyin Liu, Yincen Xie, Lin Wu 0001, Richang Hong. LipFormer: Learning to Lipread Unseen Speakers Based on Visual-Landmark Transformers. IEEE Trans. Circuits Syst. Video Techn., 33(9):4507-4517, September 2023. [doi]

Abstract

Abstract is missing.