Based-CLIP early fusion transformer for image caption

Jinyu Guo, Yuejia Li, Guanghui Cheng, Wenrui Li. Based-CLIP early fusion transformer for image caption. Signal, Image and Video Processing, 19(1):112, January 2025. [doi]

Abstract

Abstract is missing.