Actively learning costly reward functions for reinforcement learning

André Eberhard, Houssam Metni, Georg Fahland, Alexander Stroh, Pascal Friederich. Actively learning costly reward functions for reinforcement learning. Mach. Learn. Sci. Technol., 5(2):15055, 2024. [doi]

Abstract

Abstract is missing.