Say in Human-Like Way: Hierarchical Cross-modal Information Abstraction and Summarization for Controllable Captioning

Xiaoyi Wang, Jun Huang. Say in Human-Like Way: Hierarchical Cross-modal Information Abstraction and Summarization for Controllable Captioning. In Igor Farkas, Paolo Masulli, Sebastian Otte, Stefan Wermter, editors, Artificial Neural Networks and Machine Learning - ICANN 2021 - 30th International Conference on Artificial Neural Networks, Bratislava, Slovakia, September 14-17, 2021, Proceedings, Part I. Volume 12891 of Lecture Notes in Computer Science, pages 217-228, Springer, 2021. [doi]

Abstract

Abstract is missing.