Online Reinforcement Learning with Uncertain Episode Lengths

Debmalya Mandal, Goran Radanovic, Jiarui Gan, Adish Singla, Rupak Majumdar. Online Reinforcement Learning with Uncertain Episode Lengths. In Brian Williams 0001, Yiling Chen 0001, Jennifer Neville, editors, Thirty-Seventh AAAI Conference on Artificial Intelligence, AAAI 2023, Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence, IAAI 2023, Thirteenth Symposium on Educational Advances in Artificial Intelligence, EAAI 2023, Washington, DC, USA, February 7-14, 2023. pages 9064-9071, AAAI Press, 2023. [doi]

Authors

Debmalya Mandal

This author has not been identified. Look up 'Debmalya Mandal' in Google

Goran Radanovic

This author has not been identified. Look up 'Goran Radanovic' in Google

Jiarui Gan

This author has not been identified. Look up 'Jiarui Gan' in Google

Adish Singla

This author has not been identified. Look up 'Adish Singla' in Google

Rupak Majumdar

This author has not been identified. Look up 'Rupak Majumdar' in Google