How to Fine-tune the Model: Unified Model Shift and Model Bias Policy Optimization

Hai Zhang, Hang Yu, Junqiao Zhao, Di Zhang, Xiao Zhang, Hongtu Zhou, Chang Huang, Chen Ye 0002. How to Fine-tune the Model: Unified Model Shift and Model Bias Policy Optimization. In Alice Oh, Tristan Naumann, Amir Globerson, Kate Saenko, Moritz Hardt, Sergey Levine, editors, Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, NeurIPS 2023, New Orleans, LA, USA, December 10 - 16, 2023. 2023. [doi]

Authors

Hai Zhang

This author has not been identified. Look up 'Hai Zhang' in Google

Hang Yu

This author has not been identified. Look up 'Hang Yu' in Google

Junqiao Zhao

This author has not been identified. Look up 'Junqiao Zhao' in Google

Di Zhang

This author has not been identified. Look up 'Di Zhang' in Google

Xiao Zhang

This author has not been identified. Look up 'Xiao Zhang' in Google

Hongtu Zhou

This author has not been identified. Look up 'Hongtu Zhou' in Google

Chang Huang

This author has not been identified. Look up 'Chang Huang' in Google

Chen Ye 0002

This author has not been identified. Look up 'Chen Ye 0002' in Google