Convergence Proof for Actor-Critic Methods Applied to PPO and RUDDER

Markus Holzleitner, Lukas Gruber, José Antonio Arjona-Medina, Johannes Brandstetter, Sepp Hochreiter. Convergence Proof for Actor-Critic Methods Applied to PPO and RUDDER. T. Large-Scale Data- and Knowledge-Centered Systems, 48:105-130, 2021. [doi]

No reviews for this publication, yet.