Preference Ranking Optimization for Human Alignment

Feifan Song 0001, Bowen Yu 0002, Minghao Li, Haiyang Yu, Fei Huang, Yongbin Li, Houfeng Wang. Preference Ranking Optimization for Human Alignment. In Michael J. Wooldridge, Jennifer G. Dy, Sriraam Natarajan, editors, Thirty-Eigth AAAI Conference on Artificial Intelligence, AAAI 2024, Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence, IAAI 2024, Fourteenth Symposium on Educational Advances in Artificial Intelligence, EAAI 2014, February 20-27, 2024, Vancouver, Canada. pages 18990-18998, AAAI Press, 2024. [doi]

@inproceedings{00010LYHLW24,
  title = {Preference Ranking Optimization for Human Alignment},
  author = {Feifan Song 0001 and Bowen Yu 0002 and Minghao Li and Haiyang Yu and Fei Huang and Yongbin Li and Houfeng Wang},
  year = {2024},
  doi = {10.1609/aaai.v38i17.29865},
  url = {https://doi.org/10.1609/aaai.v38i17.29865},
  researchr = {https://researchr.org/publication/00010LYHLW24},
  cites = {0},
  citedby = {0},
  pages = {18990-18998},
  booktitle = {Thirty-Eigth AAAI Conference on Artificial Intelligence, AAAI 2024, Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence, IAAI 2024, Fourteenth Symposium on Educational Advances in Artificial Intelligence, EAAI 2014, February 20-27, 2024, Vancouver, Canada},
  editor = {Michael J. Wooldridge and Jennifer G. Dy and Sriraam Natarajan},
  publisher = {AAAI Press},
}