Provably Robust DPO: Aligning Language Models with Noisy Feedback

Sayak Ray Chowdhury, Anush Kini, Nagarajan Natarajan. Provably Robust DPO: Aligning Language Models with Noisy Feedback. In Forty-first International Conference on Machine Learning, ICML 2024, Vienna, Austria, July 21-27, 2024. OpenReview.net, 2024. [doi]

Authors

Sayak Ray Chowdhury

This author has not been identified. Look up 'Sayak Ray Chowdhury' in Google

Anush Kini

This author has not been identified. Look up 'Anush Kini' in Google

Nagarajan Natarajan

This author has not been identified. Look up 'Nagarajan Natarajan' in Google