PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation

Perttu Hämäläinen, Amin Babadi, Xiaoxiao Ma, Jaakko Lehtinen. PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation. In 30th IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2020, Espoo, Finland, September 21-24, 2020. pages 1-6, IEEE, 2020. [doi]

Authors

Perttu Hämäläinen

This author has not been identified. Look up 'Perttu Hämäläinen' in Google

Amin Babadi

This author has not been identified. Look up 'Amin Babadi' in Google

Xiaoxiao Ma

This author has not been identified. Look up 'Xiaoxiao Ma' in Google

Jaakko Lehtinen

This author has not been identified. Look up 'Jaakko Lehtinen' in Google