A Unified Approach for Multi-step Temporal-Difference Learning with Eligibility Traces in Reinforcement Learning

Long Yang, Minhao Shi, Qian Zheng, Wenjia Meng, Gang Pan. A Unified Approach for Multi-step Temporal-Difference Learning with Eligibility Traces in Reinforcement Learning. In Jérôme Lang, editor, Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, IJCAI 2018, July 13-19, 2018, Stockholm, Sweden. pages 2984-2990, ijcai.org, 2018. [doi]

Authors

Long Yang

This author has not been identified. Look up 'Long Yang' in Google

Minhao Shi

This author has not been identified. Look up 'Minhao Shi' in Google

Qian Zheng

This author has not been identified. Look up 'Qian Zheng' in Google

Wenjia Meng

This author has not been identified. Look up 'Wenjia Meng' in Google

Gang Pan

This author has not been identified. Look up 'Gang Pan' in Google