RLVF: Learning from Verbal Feedback without Overgeneralization

Moritz Stephan, Alexander Khazatsky, Eric Mitchell, Annie S. Chen, Sheryl Hsu, Archit Sharma, Chelsea Finn. RLVF: Learning from Verbal Feedback without Overgeneralization. In Forty-first International Conference on Machine Learning, ICML 2024, Vienna, Austria, July 21-27, 2024. OpenReview.net, 2024. [doi]

Authors

Moritz Stephan

This author has not been identified. Look up 'Moritz Stephan' in Google

Alexander Khazatsky

This author has not been identified. Look up 'Alexander Khazatsky' in Google

Eric Mitchell

This author has not been identified. Look up 'Eric Mitchell' in Google

Annie S. Chen

This author has not been identified. Look up 'Annie S. Chen' in Google

Sheryl Hsu

This author has not been identified. Look up 'Sheryl Hsu' in Google

Archit Sharma

This author has not been identified. Look up 'Archit Sharma' in Google

Chelsea Finn

This author has not been identified. Look up 'Chelsea Finn' in Google