CDZL: a controllable diversity zero-shot image caption model using large language models

Xin Zhao, Weiwei Kong, Zongyao Liu, Menghao Wang, Yiwen Li. CDZL: a controllable diversity zero-shot image caption model using large language models. Signal, Image and Video Processing, 19(4):324, April 2025. [doi]

Abstract

Abstract is missing.