Xianyu Chen, Jinhui Yang, Shi Chen, Louis Wang, Ming Jiang 0019, Qi Zhao 0001. Every Problem, Every Step, All in Focus: Learning to Solve Vision-Language Problems With Integrated Attention. IEEE Trans. Pattern Anal. Mach. Intell., 46(7):4720-4735, 2024. [doi]
Abstract is missing.