Lisong Ou, Zhixin Li 0001. Modeling Multi-Task Joint Training of Aggregate Networks for Multi-Modal Sarcasm Detection. In Cathal Gurrin, Rachada Kongkachandra, Klaus Schoeffmann, Duc-Tien Dang-Nguyen, Luca Rossetto, Shin'ichi Satoh 0001, Liting Zhou, editors, Proceedings of the 2024 International Conference on Multimedia Retrieval, ICMR 2024, Phuket, Thailand, June 10-14, 2024. pages 833-841, ACM, 2024. [doi]