DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification

Yongming Rao, Wenliang Zhao, Benlin Liu, Jiwen Lu, Jie Zhou 0001, Cho-Jui Hsieh. DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification. In Marc'Aurelio Ranzato, Alina Beygelzimer, Yann N. Dauphin, Percy Liang, Jennifer Wortman Vaughan, editors, Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual. pages 13937-13949, 2021. [doi]

Authors

Yongming Rao

This author has not been identified. Look up 'Yongming Rao' in Google

Wenliang Zhao

This author has not been identified. Look up 'Wenliang Zhao' in Google

Benlin Liu

This author has not been identified. Look up 'Benlin Liu' in Google

Jiwen Lu

This author has not been identified. Look up 'Jiwen Lu' in Google

Jie Zhou 0001

This author has not been identified. Look up 'Jie Zhou 0001' in Google

Cho-Jui Hsieh

This author has not been identified. Look up 'Cho-Jui Hsieh' in Google