Policy-based optimization: single-step policy gradient method seen as an evolution strategy

Jonathan Viquerat, R. Duvigneau, P. Meliga, Alexander Kuhnle, Elie Hachem. Policy-based optimization: single-step policy gradient method seen as an evolution strategy. Neural Computing and Applications, 35(1):449-467, 2023. [doi]

Abstract

Abstract is missing.