Distributed Adaptive Speculative Decoding: Accelerating Large Language Model Inference With Context-Aware Draft Selection - researchr publication

researchr

You are not signed in
Sign in
Sign up

Tejas Pravinbhai Patel, Vinay R. Soni, Amit Kumar Padhy, Siva Rama Krishna Varma Bayyavarapu, Milan Parikh. Distributed Adaptive Speculative Decoding: Accelerating Large Language Model Inference With Context-Aware Draft Selection. In 2026 14th International Symposium on Digital Forensics and Security (ISDFS), Boston, MA, USA, March 19-20, 2026. pages 1-6, IEEE, 2026. [doi]

Abstract is missing.

runs on WebDSL