Guohao Sun, Can Qin, Yihao Feng, Zeyuan Chen 0001, Ran Xu 0001, Sohail A. Dianat, Majid Rabbani, Raghuveer Rao, Zhiqiang Tao. Structured Policy Optimization: Enhance Large Vision-Language Model via Self-Referenced Dialogue. In IEEE/CVF International Conference on Computer Vision, ICCV 2025, Honolulu, HI, USA, October 19-25, 2025. pages 741-751, IEEE, 2025. [doi]
Abstract is missing.