Earlier Tokens Contribute More: Learning Direct Preference Optimization From Temporal Decay Perspective

Ruichen Shao, Bei Li, Gangao Liu, Yang Chen, Zhouxiang, Jingang Wang, Xunliang Cai, Peng Li. Earlier Tokens Contribute More: Learning Direct Preference Optimization From Temporal Decay Perspective. In The Thirteenth International Conference on Learning Representations, ICLR 2025, Singapore, April 24-28, 2025. OpenReview.net, 2025. [doi]

Authors

Ruichen Shao

This author has not been identified. Look up 'Ruichen Shao' in Google

Bei Li

This author has not been identified. Look up 'Bei Li' in Google

Gangao Liu

This author has not been identified. Look up 'Gangao Liu' in Google

Yang Chen

This author has not been identified. Look up 'Yang Chen' in Google

Zhouxiang

This author has not been identified. Look up 'Zhouxiang' in Google

Jingang Wang

This author has not been identified. Look up 'Jingang Wang' in Google

Xunliang Cai

This author has not been identified. Look up 'Xunliang Cai' in Google

Peng Li

This author has not been identified. Look up 'Peng Li' in Google