DRESS : Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback

Yangyi Chen, Karan Sikka, Michael Cogswell, Heng Ji, Ajay Divakaran. DRESS : Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024, Seattle, WA, USA, June 16-22, 2024. pages 14239-14250, IEEE, 2024. [doi]

Abstract

Abstract is missing.