Xinsheng Tang, Yangcheng Li, Nan Wang, Zhiyi Shu, Xingyu Ling, Junna Xing, Peng Zhou, Qiang Liu. RedFuser: An Automatic Operator Fusion Framework for Cascaded Reductions on AI Accelerators. In Benjamin C. Lee, Harry Xu 0001, Mark Silberstein, Bingyao Li, editors, Proceedings of the 31st ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2, ASPLOS 2026, Pittsburgh, PA, USA, March 22-26, 2026. pages 1566-1588, ACM, 2026. [doi]
Abstract is missing.