MoEAtt: A Deep Mixture of Experts Model using Attention-based Routing Gate

Gal Blecher, Shai Fine. MoEAtt: A Deep Mixture of Experts Model using Attention-based Routing Gate. In International Conference on Machine Learning and Applications, ICMLA 2023, Jacksonville, FL, USA, December 15-17, 2023. pages 1018-1024, IEEE, 2023. [doi]

Abstract

Abstract is missing.