Juechu Dong, Boyuan Feng, Driss Guessous, Yanbo Liang, Horace He. FlexAttention: A Programming Model for Generating Fused Attention Variants. In Matei Zaharia, Gauri Joshi, Yingyan (Celine) Lin, editors, Proceedings of the Eighth Conference on Machine Learning and Systems, MLSys 2025, Santa Clara, CA, USA, May 12-15, 2025. OpenReview.net/mlsys.org, 2025. [doi]
Abstract is missing.