Implicit Posteriori Parameter Distribution Optimization in Reinforcement Learning

Tianyi Li, GenKe Yang, Jian Chu. Implicit Posteriori Parameter Distribution Optimization in Reinforcement Learning. IEEE T. Cybernetics, 54(5):3051-3064, May 2024. [doi]

Abstract

Abstract is missing.