Align before Fuse: Vision and Language Representation Learning with Momentum Distillation - researchr publication

researchr

You are not signed in
Sign in
Sign up

Junnan Li 0001, Ramprasaath R. Selvaraju, Akhilesh Gotmare, Shafiq R. Joty, Caiming Xiong, Steven Chu Hong Hoi. Align before Fuse: Vision and Language Representation Learning with Momentum Distillation. In Marc'Aurelio Ranzato, Alina Beygelzimer, Yann N. Dauphin, Percy Liang, Jennifer Wortman Vaughan, editors, Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual. pages 9694-9705, 2021. [doi]

Abstract is missing.

runs on WebDSL