DeepAveragers: Offline Reinforcement Learning By Solving Derived Non-Parametric MDPs

Aayam Kumar Shrestha, Stefan Lee, Prasad Tadepalli, Alan Fern. DeepAveragers: Offline Reinforcement Learning By Solving Derived Non-Parametric MDPs. In 9th International Conference on Learning Representations, ICLR 2021, Virtual Event, Austria, May 3-7, 2021. OpenReview.net, 2021. [doi]

Authors

Aayam Kumar Shrestha

This author has not been identified. Look up 'Aayam Kumar Shrestha' in Google

Stefan Lee

This author has not been identified. Look up 'Stefan Lee' in Google

Prasad Tadepalli

This author has not been identified. Look up 'Prasad Tadepalli' in Google

Alan Fern

This author has not been identified. Look up 'Alan Fern' in Google