Block Transformer: Global-to-Local Language Modeling for Fast Inference

Namgyu Ho, Sangmin Bae, Taehyeon Kim 0001, Hyunjik Jo, Yireun Kim, Tal Schuster, Adam Fisch, James Thorne, Se-Young Yun. Block Transformer: Global-to-Local Language Modeling for Fast Inference. In Amir Globersons, Lester Mackey, Danielle Belgrave, Angela Fan, Ulrich Paquet, Jakub M. Tomczak, Cheng Zhang 0005, editors, Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, NeurIPS 2024, Vancouver, BC, Canada, December 10 - 15, 2024. 2024. [doi]

Authors

Namgyu Ho

This author has not been identified. Look up 'Namgyu Ho' in Google

Sangmin Bae

This author has not been identified. Look up 'Sangmin Bae' in Google

Taehyeon Kim 0001

This author has not been identified. Look up 'Taehyeon Kim 0001' in Google

Hyunjik Jo

This author has not been identified. Look up 'Hyunjik Jo' in Google

Yireun Kim

This author has not been identified. Look up 'Yireun Kim' in Google

Tal Schuster

This author has not been identified. Look up 'Tal Schuster' in Google

Adam Fisch

This author has not been identified. Look up 'Adam Fisch' in Google

James Thorne

This author has not been identified. Look up 'James Thorne' in Google

Se-Young Yun

This author has not been identified. Look up 'Se-Young Yun' in Google