SPO: Self Preference Optimization with Self Regularization

Yuhao Sun, Yifan Zhang, Quandong Wang, Qinzhuo Wu, Wei Liu, Jian Luan 0001. SPO: Self Preference Optimization with Self Regularization. In Christos Christodoulopoulos 0001, Tanmoy Chakraborty 0002, Carolyn Rose, Violet Peng, editors, Findings of the Association for Computational Linguistics: EMNLP 2025, Suzhou, China, November 4-9, 2025. pages 5601-5614, Association for Computational Linguistics, 2025. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.