Shigang Li 0002, Jingkun Dong, Jihao Chen, Zhi Ma, Zhongzhe Hu. Hypertron: Efficiently Scaling Large Models by Exploring High-Dimensional Parallelization Space. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2025, St. Louis, MO, USA, November 16-21, 2025. pages 1755-1768, ACM, 2025. [doi]
Abstract is missing.