3D-MoE: Accelerating Multi-Expert Activated LLMs on 3D In/Near-Memory Computing Architecture via Hybrid Parallelism

Xinyu Qu, Zehua Zhang, Runnan Xu, Yufei Ma 0002. 3D-MoE: Accelerating Multi-Expert Activated LLMs on 3D In/Near-Memory Computing Architecture via Hybrid Parallelism. In IEEE/ACM International Conference On Computer Aided Design, ICCAD 2025, Munich, Germany, October 26-30, 2025. pages 1-9, IEEE, 2025. [doi]

Authors

Xinyu Qu

This author has not been identified. Look up 'Xinyu Qu' in Google

Zehua Zhang

This author has not been identified. Look up 'Zehua Zhang' in Google

Runnan Xu

This author has not been identified. Look up 'Runnan Xu' in Google

Yufei Ma 0002

This author has not been identified. Look up 'Yufei Ma 0002' in Google