Learning spatiotemporal lip dynamics in 3D point cloud stream for visual voice activity detection

Jie Zhang, Jingyi Cao, Junhua Sun. Learning spatiotemporal lip dynamics in 3D point cloud stream for visual voice activity detection. Biomed. Signal Proc. and Control, 87(Part B):105410, January 2024. [doi]

Abstract

Abstract is missing.