Accelerating LLM Inference with Lossless Speculative Decoding Algorithms for Heterogeneous Vocabularies - researchr publication references

researchr

You are not signed in
Sign in
Sign up

Nadav Timor, Jonathan Mamou, Daniel Korat, Moshe Berchansky, Gaurav Jain, Oren Pereg, Moshe Wasserblat, David Harel. Accelerating LLM Inference with Lossless Speculative Decoding Algorithms for Heterogeneous Vocabularies. In Forty-second International Conference on Machine Learning, ICML 2025, Vancouver, BC, Canada, July 13-19, 2025. OpenReview.net, 2025. [doi]

No references recorded for this publication.

No citations of this publication recorded.

runs on WebDSL