Hyena Hierarchy: Towards Larger Convolutional Language Models

Michael Poli, Stefano Massaroli, Eric Nguyen, Daniel Y. Fu, Tri Dao, Stephen Baccus, Yoshua Bengio, Stefano Ermon, Christopher Ré. Hyena Hierarchy: Towards Larger Convolutional Language Models. In Andreas Krause 0001, Emma Brunskill, KyungHyun Cho, Barbara Engelhardt, Sivan Sabato, Jonathan Scarlett, editors, International Conference on Machine Learning, ICML 2023, 23-29 July 2023, Honolulu, Hawaii, USA. Volume 202 of Proceedings of Machine Learning Research, pages 28043-28078, PMLR, 2023. [doi]

Authors

Michael Poli

This author has not been identified. Look up 'Michael Poli' in Google

Stefano Massaroli

This author has not been identified. Look up 'Stefano Massaroli' in Google

Eric Nguyen

This author has not been identified. Look up 'Eric Nguyen' in Google

Daniel Y. Fu

This author has not been identified. Look up 'Daniel Y. Fu' in Google

Tri Dao

This author has not been identified. Look up 'Tri Dao' in Google

Stephen Baccus

This author has not been identified. Look up 'Stephen Baccus' in Google

Yoshua Bengio

This author has not been identified. Look up 'Yoshua Bengio' in Google

Stefano Ermon

This author has not been identified. Look up 'Stefano Ermon' in Google

Christopher Ré

This author has not been identified. Look up 'Christopher Ré' in Google