Faster, Stronger, and More Interpretable: Massive Transformer Architectures for Vision-Language Tasks

Tong Chen, Sicong Liu, Zhiran Chen, Wenyan Hu, Dachi Chen, Yuanxin Wang, Qi Lyu, Cindy X. Le, Wenping Wang. Faster, Stronger, and More Interpretable: Massive Transformer Architectures for Vision-Language Tasks. Adv. Artif. Intell. Mach. Learn., 3(3):1369-1388, 2023. [doi]

Abstract

Abstract is missing.