SMAUG: Sparse Masked Autoencoder for Efficient Video-Language Pre-training

Yuanze Lin, Chen Wei 0005, Huiyu Wang, Alan L. Yuille, Cihang Xie. SMAUG: Sparse Masked Autoencoder for Efficient Video-Language Pre-training. In IEEE/CVF International Conference on Computer Vision, ICCV 2023, Paris, France, October 1-6, 2023. pages 2459-2469, IEEE, 2023. [doi]

Authors

Yuanze Lin

This author has not been identified. Look up 'Yuanze Lin' in Google

Chen Wei 0005

This author has not been identified. Look up 'Chen Wei 0005' in Google

Huiyu Wang

This author has not been identified. Look up 'Huiyu Wang' in Google

Alan L. Yuille

This author has not been identified. Look up 'Alan L. Yuille' in Google

Cihang Xie

This author has not been identified. Look up 'Cihang Xie' in Google