Jonathan Viquerat, R. Duvigneau, P. Meliga, Alexander Kuhnle, Elie Hachem. Policy-based optimization: single-step policy gradient method seen as an evolution strategy. Neural Computing and Applications, 35(1):449-467, 2023. [doi]
Abstract is missing.