PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation

Perttu Hämäläinen, Amin Babadi, Xiaoxiao Ma, Jaakko Lehtinen. PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation. In 30th IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2020, Espoo, Finland, September 21-24, 2020. pages 1-6, IEEE, 2020. [doi]

Abstract

Abstract is missing.