TD_gamma: Re-evaluating Complex Backups in Temporal Difference Learning

George Konidaris, Scott Niekum, Philip S. Thomas. TD_gamma: Re-evaluating Complex Backups in Temporal Difference Learning. In John Shawe-Taylor, Richard S. Zemel, Peter L. Bartlett, Fernando C. N. Pereira, Kilian Q. Weinberger, editors, Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, Granada, Spain. pages 2402-2410, 2011. [doi]

Abstract

Abstract is missing.