Stabilizing Actor Policies by Approximating Advantage Distributions from K Critics

Alfonso B. Labao, Prospero C. Naval. Stabilizing Actor Policies by Approximating Advantage Distributions from K Critics. In 24th International Conference on Pattern Recognition, ICPR 2018, Beijing, China, August 20-24, 2018. pages 1253-1258, IEEE Computer Society, 2018. [doi]

Abstract

Abstract is missing.