PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models

Fei Deng, Qifei Wang, Wei Wei, Tingbo Hou, Matthias Grundmann. PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024, Seattle, WA, USA, June 16-22, 2024. pages 7423-7433, IEEE, 2024. [doi]

Bibliographies