Direct Policy Search Reinforcement Learning Based on Variational Bayesian Inference

Nobuhiko Yamaguchi. Direct Policy Search Reinforcement Learning Based on Variational Bayesian Inference. JACIII, 24(6):711-718, 2020. [doi]

Abstract

Abstract is missing.