LookupFFN: Making Transformers Compute-lite for CPU inference

Zhanpeng Zeng, Michael Davies, Pranav Pulijala, Karthikeyan Sankaralingam, Vikas Singh. LookupFFN: Making Transformers Compute-lite for CPU inference. In Andreas Krause 0001, Emma Brunskill, KyungHyun Cho, Barbara Engelhardt, Sivan Sabato, Jonathan Scarlett, editors, International Conference on Machine Learning, ICML 2023, 23-29 July 2023, Honolulu, Hawaii, USA. Volume 202 of Proceedings of Machine Learning Research, pages 40707-40718, PMLR, 2023. [doi]

Authors

Zhanpeng Zeng

This author has not been identified. Look up 'Zhanpeng Zeng' in Google

Michael Davies

This author has not been identified. Look up 'Michael Davies' in Google

Pranav Pulijala

This author has not been identified. Look up 'Pranav Pulijala' in Google

Karthikeyan Sankaralingam

This author has not been identified. Look up 'Karthikeyan Sankaralingam' in Google

Vikas Singh

This author has not been identified. Look up 'Vikas Singh' in Google