CroMA: Cross-Modal Attention for Visual Question Answering in Robotic Surgery

Greetta Antonio, Jobin Jose, Sudhish N. George, Kiran B. Raja. CroMA: Cross-Modal Attention for Visual Question Answering in Robotic Surgery. In Apostolos Antonacopoulos, Subhasis Chaudhuri, Rama Chellappa, Cheng-Lin Liu 0001, Saumik Bhattacharya, Umapada Pal 0001, editors, Pattern Recognition - 27th International Conference, ICPR 2024, Kolkata, India, December 1-5, 2024, Proceedings, Part XXX. Volume 15330 of Lecture Notes in Computer Science, pages 459-471, Springer, 2024. [doi]