Applying Positional Encoding to Enhance Vision-Language Transformers

Xuehao Liu, Sarah Jane Delany, Susan McKeever. Applying Positional Encoding to Enhance Vision-Language Transformers. In Petia Radeva, Giovanni Maria Farinella, Kadi Bouatouch, editors, Proceedings of the 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, VISIGRAPP 2023, Volume 5: VISAPP, Lisbon, Portugal, February 19-21, 2023. pages 838-845, SCITEPRESS, 2023. [doi]

Authors

Xuehao Liu

This author has not been identified. Look up 'Xuehao Liu' in Google

Sarah Jane Delany

This author has not been identified. Look up 'Sarah Jane Delany' in Google

Susan McKeever

This author has not been identified. Look up 'Susan McKeever' in Google