Finite Sample Analysis of Average-Reward TD Learning and $Q$-Learning

Sheng Zhang, Zhe Zhang, Siva Theja Maguluri. Finite Sample Analysis of Average-Reward TD Learning and $Q$-Learning. In Marc'Aurelio Ranzato, Alina Beygelzimer, Yann N. Dauphin, Percy Liang, Jennifer Wortman Vaughan, editors, Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual. pages 1230-1242, 2021. [doi]

Authors

Sheng Zhang

This author has not been identified. Look up 'Sheng Zhang' in Google

Zhe Zhang

This author has not been identified. Look up 'Zhe Zhang' in Google

Siva Theja Maguluri

This author has not been identified. Look up 'Siva Theja Maguluri' in Google