Convergence Proof for Actor-Critic Methods Applied to PPO and RUDDER

Markus Holzleitner, Lukas Gruber, José Antonio Arjona-Medina, Johannes Brandstetter, Sepp Hochreiter. Convergence Proof for Actor-Critic Methods Applied to PPO and RUDDER. T. Large-Scale Data- and Knowledge-Centered Systems, 48:105-130, 2021. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.