How to Evaluate Reward Models for RLHF - researchr publication references

researchr

You are not signed in
Sign in
Sign up

Evan Frick, Tianle Li, Connor Chen, Wei-Lin Chiang, Anastasios Nikolas Angelopoulos, Jiantao Jiao, Banghua Zhu, Joseph E. Gonzalez, Ion Stoica. How to Evaluate Reward Models for RLHF. In The Thirteenth International Conference on Learning Representations, ICLR 2025, Singapore, April 24-28, 2025. OpenReview.net, 2025. [doi]

No references recorded for this publication.

No citations of this publication recorded.

runs on WebDSL