Diedre Carmo, Letícia Rittner, Roberto de Alencar Lotufo. VisualT5: Multitasking Caption and Concept Prediction with Pre-trained ViT, T5 and Customized Spatial Attention in Radiological Images. In Guglielmo Faggioli, Nicola Ferro 0001, Petra Galuscáková, Alba García Seco de Herrera, editors, Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2024), Grenoble, France, 9-12 September, 2024. Volume 3740 of CEUR Workshop Proceedings, pages 1525-1532, CEUR-WS.org, 2024. [doi]
Abstract is missing.