Learning Fair Policies in Multi-objective Preference-Based Reinforcement Learning

Umer Siddique, Abhinav Sinha, Yongcan Cao. Learning Fair Policies in Multi-objective Preference-Based Reinforcement Learning. Machine Learning, 115(1):23, January 2026. [doi]

Authors

Umer Siddique

This author has not been identified. Look up 'Umer Siddique' in Google

Abhinav Sinha

This author has not been identified. Look up 'Abhinav Sinha' in Google

Yongcan Cao

This author has not been identified. Look up 'Yongcan Cao' in Google