DAWN: Efficient Distribution of Attention Workload in PIM-Enabled Systems for LLM Inference

Jaehoon Chung, Jinho Han, Young-Ho Gong, Sung Woo Chung. DAWN: Efficient Distribution of Attention Workload in PIM-Enabled Systems for LLM Inference. Computer Architecture Letters, 25(1):65-68, January - June 2026. [doi]

Abstract

Abstract is missing.