CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents

Tianqi Xu, Linyao Chen, Dai-Jie Wu, Yanjun Chen, Zecheng Zhang, Xiang Yao, Zhiqiang Xie, Yongchao Chen, Shilong Liu, Bochen Qian, Anjie Yang, Zhaoxuan Jin, Jianbo Deng, Philip Torr 0001, Bernard Ghanem, Guohao Li 0001. CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. In Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar, editors, Findings of the Association for Computational Linguistics, ACL 2025, Vienna, Austria, July 27 - August 1, 2025. pages 21607-21647, Association for Computational Linguistics, 2025. [doi]

Authors

Tianqi Xu

This author has not been identified. Look up 'Tianqi Xu' in Google

Linyao Chen

This author has not been identified. Look up 'Linyao Chen' in Google

Dai-Jie Wu

This author has not been identified. Look up 'Dai-Jie Wu' in Google

Yanjun Chen

This author has not been identified. Look up 'Yanjun Chen' in Google

Zecheng Zhang

This author has not been identified. Look up 'Zecheng Zhang' in Google

Xiang Yao

This author has not been identified. Look up 'Xiang Yao' in Google

Zhiqiang Xie

This author has not been identified. Look up 'Zhiqiang Xie' in Google

Yongchao Chen

This author has not been identified. Look up 'Yongchao Chen' in Google

Shilong Liu

This author has not been identified. Look up 'Shilong Liu' in Google

Bochen Qian

This author has not been identified. Look up 'Bochen Qian' in Google

Anjie Yang

This author has not been identified. Look up 'Anjie Yang' in Google

Zhaoxuan Jin

This author has not been identified. Look up 'Zhaoxuan Jin' in Google

Jianbo Deng

This author has not been identified. Look up 'Jianbo Deng' in Google

Philip Torr 0001

This author has not been identified. Look up 'Philip Torr 0001' in Google

Bernard Ghanem

This author has not been identified. Look up 'Bernard Ghanem' in Google

Guohao Li 0001

This author has not been identified. Look up 'Guohao Li 0001' in Google