Enhancing LLM to Decompile Optimized PTX to Readable CUDA for Tensor Programs

Xinyu Sun, Fugen Tang, Yu Zhang, Han Shen, Chengru Song, Di Zhang. Enhancing LLM to Decompile Optimized PTX to Readable CUDA for Tensor Programs. In 40th IEEE/ACM International Conference on Automated Software Engineering, ASE 2025, Seoul, Korea, Republic of, November 16-20, 2025. pages 2235-2247, IEEE, 2025. [doi]

Abstract

Abstract is missing.