Patch-level Routing in Mixture-of-Experts is Provably Sample-efficient for Convolutional Neural Networks

Mohammed Nowaz Rabbani Chowdhury, Shuai Zhang 0015, Meng Wang 0003, Sijia Liu 0001, Pin-Yu Chen. Patch-level Routing in Mixture-of-Experts is Provably Sample-efficient for Convolutional Neural Networks. In Andreas Krause 0001, Emma Brunskill, KyungHyun Cho, Barbara Engelhardt, Sivan Sabato, Jonathan Scarlett, editors, International Conference on Machine Learning, ICML 2023, 23-29 July 2023, Honolulu, Hawaii, USA. Volume 202 of Proceedings of Machine Learning Research, pages 6074-6114, PMLR, 2023. [doi]

Abstract

Abstract is missing.