SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents

Kanzhi Cheng, Qiushi Sun, Yougang Chu, Fangzhi Xu, Yantao Li 0003, Jianbing Zhang, Zhiyong Wu 0003. SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents. In Lun-Wei Ku, Andre Martins, Vivek Srikumar, editors, Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2024, Bangkok, Thailand, August 11-16, 2024. pages 9313-9332, Association for Computational Linguistics, 2024. [doi]

Authors

Kanzhi Cheng

This author has not been identified. Look up 'Kanzhi Cheng' in Google

Qiushi Sun

This author has not been identified. Look up 'Qiushi Sun' in Google

Yougang Chu

This author has not been identified. Look up 'Yougang Chu' in Google

Fangzhi Xu

This author has not been identified. Look up 'Fangzhi Xu' in Google

Yantao Li 0003

This author has not been identified. Look up 'Yantao Li 0003' in Google

Jianbing Zhang

This author has not been identified. Look up 'Jianbing Zhang' in Google

Zhiyong Wu 0003

This author has not been identified. Look up 'Zhiyong Wu 0003' in Google