End-to-End Referring Video Object Segmentation with Multimodal Transformers

Adam Botach, Evgenii Zheltonozhskii, Chaim Baskin. End-to-End Referring Video Object Segmentation with Multimodal Transformers. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, LA, USA, June 18-24, 2022. pages 4975-4985, IEEE, 2022. [doi]

Abstract

Abstract is missing.