PARM: Multi-Objective Test-Time Alignment via Preference-Aware Autoregressive Reward Model - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Baijiong Lin, Weisen Jiang, Yuancheng Xu, Hao Chen, Ying-Cong Chen. PARM: Multi-Objective Test-Time Alignment via Preference-Aware Autoregressive Reward Model. In Forty-second International Conference on Machine Learning, ICML 2025, Vancouver, BC, Canada, July 13-19, 2025. OpenReview.net, 2025. [doi]

This author has not been identified. Look up 'Baijiong Lin' in GoogleThis author has not been identified. Look up 'Weisen Jiang' in GoogleThis author has not been identified. Look up 'Yuancheng Xu' in GoogleThis author has not been identified. It may be one of the following persons:

Hao Chen

Look up 'Hao Chen' in GoogleThis author has not been identified. Look up 'Ying-Cong Chen' in Google

runs on WebDSL