Iterative Foundation Model Fine-Tuning on Multiple Rewards

Pouya M. Ghari, Simone Sciabola, Ye Wang 0024. Iterative Foundation Model Fine-Tuning on Multiple Rewards. In Danielle Belgrave, Cheng Zhang 0005, Laura N. Montoya, Hsuan-Tien Lin, Razvan Pascanu, Piotr Koniusz, Marzyeh Ghassemi, Nancy Chen, Iván Vladimir Meza Ruíz, Arturo Loaiza-Bonilla, editors, Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, NeurIPS 2025, San Diago, CA, USA, December 2-7, 2025 / Mexico City, Mexico, November 30 - December 5, 2025. 2025. [doi]

Authors

Pouya M. Ghari

This author has not been identified. Look up 'Pouya M. Ghari' in Google

Simone Sciabola

This author has not been identified. Look up 'Simone Sciabola' in Google

Ye Wang 0024

This author has not been identified. Look up 'Ye Wang 0024' in Google