Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Yichao Fu, Peter Bailis, Ion Stoica, Hao Zhang 0108. Break the Sequential Dependency of LLM Inference Using Lookahead Decoding. In Forty-first International Conference on Machine Learning, ICML 2024, Vienna, Austria, July 21-27, 2024. OpenReview.net, 2024. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.