Multi-Speaker Video Dialog with Frame-Level Temporal Localization

Qiang Wang, Pin Jiang, Zhiyi Guo, Yahong Han, Zhou Zhao. Multi-Speaker Video Dialog with Frame-Level Temporal Localization. In The Thirty-Fourth AAAI Conference on Artificial Intelligence, AAAI 2020, The Thirty-Second Innovative Applications of Artificial Intelligence Conference, IAAI 2020, The Tenth AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2020, New York, NY, USA, February 7-12, 2020. pages 12200-12207, AAAI Press, 2020. [doi]

Abstract

Abstract is missing.