A Reinforcement Learning Method for Environments with Stochastic Variables: Post-Decision Proximal Policy Optimization with Dual Critic Networks

Leonardo Kanashiro Felizardo, Edoardo Fadda, Paolo Brandimarte, Emilio Del Moral Hernandez, MariĆ” Cristina Vasconcelos Nascimento. A Reinforcement Learning Method for Environments with Stochastic Variables: Post-Decision Proximal Policy Optimization with Dual Critic Networks. In International Joint Conference on Neural Networks, IJCNN 2025, Rome, Italy, June 30 - July 5, 2025. pages 1-8, IEEE, 2025. [doi]

Abstract

Abstract is missing.