The Importance of Online Data: Understanding Preference Fine-tuning via Coverage

Yuda Song 0001, Gokul Swamy, Aarti Singh, J. Andrew Bagnell, Wen Sun 0002. The Importance of Online Data: Understanding Preference Fine-tuning via Coverage. In Amir Globersons, Lester Mackey, Danielle Belgrave, Angela Fan, Ulrich Paquet, Jakub M. Tomczak, Cheng Zhang 0005, editors, Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, NeurIPS 2024, Vancouver, BC, Canada, December 10 - 15, 2024. 2024. [doi]

Authors

Yuda Song 0001

This author has not been identified. Look up 'Yuda Song 0001' in Google

Gokul Swamy

This author has not been identified. Look up 'Gokul Swamy' in Google

Aarti Singh

This author has not been identified. Look up 'Aarti Singh' in Google

J. Andrew Bagnell

This author has not been identified. Look up 'J. Andrew Bagnell' in Google

Wen Sun 0002

This author has not been identified. Look up 'Wen Sun 0002' in Google