PitVQA: Image-Grounded Text Embedding LLM for Visual Question Answering in Pituitary Surgery

Runlong He, Mengya Xu, Adrito Das, Danyal Z. Khan, Sophia Bano, Hani J. Marcus, Danail Stoyanov, Matthew J. Clarkson, Mobarakol Islam. PitVQA: Image-Grounded Text Embedding LLM for Visual Question Answering in Pituitary Surgery. In Marius George Linguraru, Qi Dou 0001, Aasa Feragen, Stamatia Giannarou, Ben Glocker, Karim Lekadir, Julia A. Schnabel, editors, Medical Image Computing and Computer Assisted Intervention - MICCAI 2024 - 27th International Conference, Marrakesh, Morocco, October 6-10, 2024, Proceedings, Part VI. Volume 15006 of Lecture Notes in Computer Science, pages 488-498, Springer, 2024. [doi]

Authors

Runlong He

This author has not been identified. Look up 'Runlong He' in Google

Mengya Xu

This author has not been identified. Look up 'Mengya Xu' in Google

Adrito Das

This author has not been identified. Look up 'Adrito Das' in Google

Danyal Z. Khan

This author has not been identified. Look up 'Danyal Z. Khan' in Google

Sophia Bano

This author has not been identified. Look up 'Sophia Bano' in Google

Hani J. Marcus

This author has not been identified. Look up 'Hani J. Marcus' in Google

Danail Stoyanov

This author has not been identified. Look up 'Danail Stoyanov' in Google

Matthew J. Clarkson

This author has not been identified. Look up 'Matthew J. Clarkson' in Google

Mobarakol Islam

This author has not been identified. Look up 'Mobarakol Islam' in Google