Video-driven speaker-listener generation based on Transformer and neural renderer

Daowu Yang, Qi Yang, Wen Jiang, Jifeng Chen, Zhengxi Shao, Qiong Liu. Video-driven speaker-listener generation based on Transformer and neural renderer. Multimedia Tools Appl., 83(27):70501-70522, August 2024. [doi]

Abstract

Abstract is missing.