Zhichun Li, Jun Zhou, Xueqi Li 0001, Ninghui Sun. BlockPIM: Optimizing Memory Management for PIM-enabled Long-Context LLM Inference. In 62nd ACM/IEEE Design Automation Conference, DAC 2025, San Francisco, CA, USA, June 22-25, 2025. pages 1-7, IEEE, 2025. [doi]
Abstract is missing.