Adaptive Attention for Sparse-based Long-sequence Transformer

Xuanyu Zhang, Zhepeng Lv, Qing Yang. Adaptive Attention for Sparse-based Long-sequence Transformer. In Anna Rogers, Jordan L. Boyd-Graber, Naoaki Okazaki, editors, Findings of the Association for Computational Linguistics: ACL 2023, Toronto, Canada, July 9-14, 2023. pages 8602-8610, Association for Computational Linguistics, 2023. [doi]

Authors

Xuanyu Zhang

This author has not been identified. Look up 'Xuanyu Zhang' in Google

Zhepeng Lv

This author has not been identified. Look up 'Zhepeng Lv' in Google

Qing Yang

This author has not been identified. Look up 'Qing Yang' in Google