A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning

Andrew Patterson, Adam White 0001, Martha White. A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning. Journal of Machine Learning Research, 23, 2022. [doi]

@article{Patterson0W22,
  title = {A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning},
  author = {Andrew Patterson and Adam White 0001 and Martha White},
  year = {2022},
  url = {http://jmlr.org/papers/v23/21-037.html},
  researchr = {https://researchr.org/publication/Patterson0W22},
  cites = {0},
  citedby = {0},
  journal = {Journal of Machine Learning Research},
  volume = {23},
}