AAN+: Generalized Average Attention Network for Accelerating Neural Transformer

Biao Zhang 0002, Deyi Xiong, Yubin Ge, Junfeng Yao, Hao Yue, Jinsong Su. AAN+: Generalized Average Attention Network for Accelerating Neural Transformer. J. Artif. Intell. Res. (JAIR), 75:677-708, 2022. [doi]

Authors

Biao Zhang 0002

This author has not been identified. Look up 'Biao Zhang 0002' in Google

Deyi Xiong

This author has not been identified. Look up 'Deyi Xiong' in Google

Yubin Ge

This author has not been identified. Look up 'Yubin Ge' in Google

Junfeng Yao

This author has not been identified. Look up 'Junfeng Yao' in Google

Hao Yue

This author has not been identified. Look up 'Hao Yue' in Google

Jinsong Su

This author has not been identified. Look up 'Jinsong Su' in Google