A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning

Andrew Patterson, Adam White 0001, Martha White. A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning. Journal of Machine Learning Research, 23, 2022. [doi]

Authors

Andrew Patterson

This author has not been identified. Look up 'Andrew Patterson' in Google

Adam White 0001

This author has not been identified. Look up 'Adam White 0001' in Google

Martha White

This author has not been identified. Look up 'Martha White' in Google