SamCap: Energy-based Controllable Image Captioning by Gradient-Based Sampling

Yuchen Niu, Min Zhu, Zhihua Wei. SamCap: Energy-based Controllable Image Captioning by Gradient-Based Sampling. In Cathal Gurrin, Rachada Kongkachandra, Klaus Schoeffmann, Duc-Tien Dang-Nguyen, Luca Rossetto, Shin'ichi Satoh 0001, Liting Zhou, editors, Proceedings of the 2024 International Conference on Multimedia Retrieval, ICMR 2024, Phuket, Thailand, June 10-14, 2024. pages 608-617, ACM, 2024. [doi]

Abstract

Abstract is missing.