DeepAveragers: Offline Reinforcement Learning By Solving Derived Non-Parametric MDPs

Aayam Kumar Shrestha, Stefan Lee, Prasad Tadepalli, Alan Fern. DeepAveragers: Offline Reinforcement Learning By Solving Derived Non-Parametric MDPs. In 9th International Conference on Learning Representations, ICLR 2021, Virtual Event, Austria, May 3-7, 2021. OpenReview.net, 2021. [doi]

@inproceedings{ShresthaLTF21,
  title = {DeepAveragers: Offline Reinforcement Learning By Solving Derived Non-Parametric MDPs},
  author = {Aayam Kumar Shrestha and Stefan Lee and Prasad Tadepalli and Alan Fern},
  year = {2021},
  url = {https://openreview.net/forum?id=eMP1j9efXtX},
  researchr = {https://researchr.org/publication/ShresthaLTF21},
  cites = {0},
  citedby = {0},
  booktitle = {9th International Conference on Learning Representations, ICLR 2021, Virtual Event, Austria, May 3-7, 2021},
  publisher = {OpenReview.net},
}