Inverse Reinforcement Learning with Agents' Biased Exploration Based on Sub-Optimal Sequential Action Data

Fumito Uwano, Satoshi Hasegawa, Keiki Takadama. Inverse Reinforcement Learning with Agents' Biased Exploration Based on Sub-Optimal Sequential Action Data. JACIII, 28(2):380-392, March 2024. [doi]

Abstract

Abstract is missing.