Batch Active Learning of Reward Functions from Human Preferences

Erdem Biyik, Nima Anari, Dorsa Sadigh. Batch Active Learning of Reward Functions from Human Preferences. ACM Trans. Hum. Robot Interact., 13(2), June 2024. [doi]

Abstract

Abstract is missing.