Preference Ranking Optimization for Human Alignment

Feifan Song 0001, Bowen Yu 0002, Minghao Li, Haiyang Yu, Fei Huang, Yongbin Li, Houfeng Wang. Preference Ranking Optimization for Human Alignment. In Michael J. Wooldridge, Jennifer G. Dy, Sriraam Natarajan, editors, Thirty-Eigth AAAI Conference on Artificial Intelligence, AAAI 2024, Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence, IAAI 2024, Fourteenth Symposium on Educational Advances in Artificial Intelligence, EAAI 2014, February 20-27, 2024, Vancouver, Canada. pages 18990-18998, AAAI Press, 2024. [doi]

Authors

Feifan Song 0001

This author has not been identified. Look up 'Feifan Song 0001' in Google

Bowen Yu 0002

This author has not been identified. Look up 'Bowen Yu 0002' in Google

Minghao Li

This author has not been identified. Look up 'Minghao Li' in Google

Haiyang Yu

This author has not been identified. Look up 'Haiyang Yu' in Google

Fei Huang

This author has not been identified. Look up 'Fei Huang' in Google

Yongbin Li

This author has not been identified. Look up 'Yongbin Li' in Google

Houfeng Wang

This author has not been identified. Look up 'Houfeng Wang' in Google