Dung Nguyen Tien, Kien Nguyen, Sridha Sridharan, David Dean, Clinton Fookes. Deep spatio-temporal feature fusion with compact bilinear pooling for multimodal emotion recognition. Computer Vision and Image Understanding, 174:33-42, 2018. [doi]
Abstract is missing.