LiteTransformerSearch: Training-free Neural Architecture Search for Efficient Language Models

Mojan Javaheripi, Gustavo de Rosa, Subhabrata Mukherjee, Shital Shah, Tomasz Religa, Caio Cesar Teodoro Mendes, Sébastien Bubeck, Farinaz Koushanfar, Debadeepta Dey. LiteTransformerSearch: Training-free Neural Architecture Search for Efficient Language Models. In Sanmi Koyejo, S. Mohamed, A. Agarwal, Danielle Belgrave, K. Cho, A. Oh, editors, Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, NeurIPS 2022, New Orleans, LA, USA, November 28 - December 9, 2022. 2022. [doi]

Authors

Mojan Javaheripi

This author has not been identified. Look up 'Mojan Javaheripi' in Google

Gustavo de Rosa

This author has not been identified. Look up 'Gustavo de Rosa' in Google

Subhabrata Mukherjee

This author has not been identified. Look up 'Subhabrata Mukherjee' in Google

Shital Shah

This author has not been identified. Look up 'Shital Shah' in Google

Tomasz Religa

This author has not been identified. Look up 'Tomasz Religa' in Google

Caio Cesar Teodoro Mendes

This author has not been identified. Look up 'Caio Cesar Teodoro Mendes' in Google

Sébastien Bubeck

This author has not been identified. Look up 'Sébastien Bubeck' in Google

Farinaz Koushanfar

This author has not been identified. Look up 'Farinaz Koushanfar' in Google

Debadeepta Dey

This author has not been identified. Look up 'Debadeepta Dey' in Google