TRCaptionNet: A novel and accurate deep Turkish image captioning model with vision transformer based image encoders and deep linguistic text decoders

Serdar Yildiz, Abbas Memis, Songül Varli. TRCaptionNet: A novel and accurate deep Turkish image captioning model with vision transformer based image encoders and deep linguistic text decoders. Turkish J. Electr. Eng. Comput. Sci., 31(6):1079-1098, October 2023. [doi]

Abstract

Abstract is missing.