Compound Tokens: Channel Fusion for Vision-Language Representation Learning

Maxwell Mbabilla Aladago, A. J. Piergiovanni. Compound Tokens: Channel Fusion for Vision-Language Representation Learning. In Krystal Maughan, Rosanne Liu, Thomas F. Burns, editors, The First Tiny Papers Track at ICLR 2023, Tiny Papers @ ICLR 2023, Kigali, Rwanda, May 5, 2023. OpenReview.net, 2023. [doi]

Authors

Maxwell Mbabilla Aladago

This author has not been identified. Look up 'Maxwell Mbabilla Aladago' in Google

A. J. Piergiovanni

This author has not been identified. Look up 'A. J. Piergiovanni' in Google