Reward-Weighted Regression with Sample Reuse for Direct Policy Search in Reinforcement Learning

Hirotaka Hachiya, Jan Peters, Masashi Sugiyama. Reward-Weighted Regression with Sample Reuse for Direct Policy Search in Reinforcement Learning. Neural Computation, 23(11):2798-2832, 2011. [doi]

Abstract

Abstract is missing.