Liqing Chen, Yifan Zhuo, Yingjie Wu, Yilei Wang, Xianghan Zheng. Multi-modal Feature Fusion Based on Variational Autoencoder for Visual Question Answering. In Zhouchen Lin, Liang Wang 0001, Jian Yang 0003, Guangming Shi, Tieniu Tan, Nanning Zheng, Xilin Chen, Yanning Zhang, editors, Pattern Recognition and Computer Vision - Second Chinese Conference, PRCV 2019, Xi'an, China, November 8-11, 2019, Proceedings, Part II. Volume 11858 of Lecture Notes in Computer Science, pages 657-669, Springer, 2019. [doi]
Abstract is missing.