CommitMoE: Efficient Fallback-Free MoE Inference with Offloading Under GPU Memory Constraints

Han Li, Jingwei Sun 0001, Junqing Lin, Guangzhong Sun. CommitMoE: Efficient Fallback-Free MoE Inference with Offloading Under GPU Memory Constraints. In Sven Koenig, Chad Jenkins, Matthew E. Taylor, editors, Fortieth AAAI Conference on Artificial Intelligence, Thirty-Eighth Conference on Innovative Applications of Artificial Intelligence, Sixteenth Symposium on Educational Advances in Artificial Intelligence, AAAI 2026, Singapore, January 20-27, 2026. pages 22904-22912, AAAI Press, 2026. [doi]

Authors

Han Li

This author has not been identified. Look up 'Han Li' in Google

Jingwei Sun 0001

This author has not been identified. Look up 'Jingwei Sun 0001' in Google

Junqing Lin

This author has not been identified. Look up 'Junqing Lin' in Google

Guangzhong Sun

This author has not been identified. Look up 'Guangzhong Sun' in Google