Why do We Need Large Batchsizes in Contrastive Learning? A Gradient-Bias Perspective - researchr publication

researchr

You are not signed in
Sign in
Sign up

Changyou Chen, Jianyi Zhang, Yi Xu, Liqun Chen, Jiali Duan, Yiran Chen 0001, Son Tran, Belinda Zeng, Trishul Chilimbi. Why do We Need Large Batchsizes in Contrastive Learning? A Gradient-Bias Perspective. In Sanmi Koyejo, S. Mohamed, A. Agarwal, Danielle Belgrave, K. Cho, A. Oh, editors, Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, NeurIPS 2022, New Orleans, LA, USA, November 28 - December 9, 2022. 2022. [doi]

Abstract is missing.

runs on WebDSL