ShowUI: One Vision-Language-Action Model for GUI Visual Agent - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Kevin Qinghong Lin, Linjie Li, Difei Gao, Zhengyuan Yang, Shiwei Wu, Zechen Bai, Stan Weixian Lei, Lijuan Wang, Mike Zheng Shou. ShowUI: One Vision-Language-Action Model for GUI Visual Agent. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2025, Nashville, TN, USA, June 11-15, 2025. pages 19498-19508, Computer Vision Foundation / IEEE, 2025. [doi]

This author has not been identified. Look up 'Kevin Qinghong Lin' in GoogleThis author has not been identified. Look up 'Linjie Li' in GoogleThis author has not been identified. Look up 'Difei Gao' in GoogleThis author has not been identified. Look up 'Zhengyuan Yang' in GoogleThis author has not been identified. Look up 'Shiwei Wu' in GoogleThis author has not been identified. Look up 'Zechen Bai' in GoogleThis author has not been identified. Look up 'Stan Weixian Lei' in GoogleThis author has not been identified. Look up 'Lijuan Wang' in GoogleThis author has not been identified. Look up 'Mike Zheng Shou' in Google

runs on WebDSL