Learning to Build Multimodal Intelligence across Vision, Language and Speech

Shuang Ma. Learning to Build Multimodal Intelligence across Vision, Language and Speech. PhD thesis, University at Buffalo, New York, USA, 2019. [doi]

Abstract

Abstract is missing.