Distributed Adaptive Speculative Decoding: Accelerating Large Language Model Inference With Context-Aware Draft Selection

Tejas Pravinbhai Patel, Vinay R. Soni, Amit Kumar Padhy, Siva Rama Krishna Varma Bayyavarapu, Milan Parikh. Distributed Adaptive Speculative Decoding: Accelerating Large Language Model Inference With Context-Aware Draft Selection. In 2026 14th International Symposium on Digital Forensics and Security (ISDFS), Boston, MA, USA, March 19-20, 2026. pages 1-6, IEEE, 2026. [doi]

Abstract

Abstract is missing.