A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Andrew Patterson, Adam White 0001, Martha White. A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning. Journal of Machine Learning Research, 23, 2022. [doi]

This author has not been identified. Look up 'Andrew Patterson' in GoogleThis author has not been identified. Look up 'Adam White 0001' in GoogleThis author has not been identified. Look up 'Martha White' in Google

runs on WebDSL