Negating Negatives: Alignment with Human Negative Samples via Distributional Dispreference Optimization

Shitong Duan, Xiaoyuan Yi, Peng Zhang 0060, Yan Liu 0002, Zheng Liu 0011, Tun Lu, Xing Xie 0001, Ning Gu. Negating Negatives: Alignment with Human Negative Samples via Distributional Dispreference Optimization. In Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen, editors, Findings of the Association for Computational Linguistics: EMNLP 2024, Miami, Florida, USA, November 12-16, 2024. pages 1012-1042, Association for Computational Linguistics, 2024. [doi]

Abstract

Abstract is missing.