Sparse MoE Students for Efficient Knowledge Distillation

Jongwon Ryu, Mingi Kim, Junyeong Kim. Sparse MoE Students for Efficient Knowledge Distillation. IEEE Access, 13:187373-187382, 2025. [doi]

Abstract

Abstract is missing.