Yangyang Zhao, Zhenyu Wang, Kai Yin, Rui Zhang 0046, Zhenhua Huang, Pei Wang. Dynamic Reward-Based Dueling Deep Dyna-Q: Robust Policy Learning in Noisy Environments. In The Thirty-Fourth AAAI Conference on Artificial Intelligence, AAAI 2020, The Thirty-Second Innovative Applications of Artificial Intelligence Conference, IAAI 2020, The Tenth AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2020, New York, NY, USA, February 7-12, 2020. pages 9676-9684, AAAI Press, 2020. [doi]
Abstract is missing.