Double-Attention Transformer for Cross-Modal Image Captioning: Enhancing Visual-Linguistic Alignment on Low-Resource Datasets

Muhammad Aoun, Tehseen Mazhar, Tariq Shahzad, Wajahat Waheed, Habib Hamam. Double-Attention Transformer for Cross-Modal Image Captioning: Enhancing Visual-Linguistic Alignment on Low-Resource Datasets. Applied Comp. Int. Soft Computing, 2026(1), 2026. [doi]

Abstract

Abstract is missing.