Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Fengdi Che, Chenjun Xiao, Jincheng Mei, Bo Dai 0001, Ramki Gummadi, Oscar A. Ramirez, Christopher K. Harris, A. Rupam Mahmood, Dale Schuurmans. Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation. In Forty-first International Conference on Machine Learning, ICML 2024, Vienna, Austria, July 21-27, 2024. OpenReview.net, 2024. [doi]

This author has not been identified. Look up 'Fengdi Che' in GoogleThis author has not been identified. Look up 'Chenjun Xiao' in GoogleThis author has not been identified. Look up 'Jincheng Mei' in GoogleThis author has not been identified. Look up 'Bo Dai 0001' in GoogleThis author has not been identified. Look up 'Ramki Gummadi' in GoogleThis author has not been identified. Look up 'Oscar A. Ramirez' in GoogleThis author has not been identified. Look up 'Christopher K. Harris' in GoogleThis author has not been identified. Look up 'A. Rupam Mahmood' in GoogleThis author has not been identified. Look up 'Dale Schuurmans' in Google

runs on WebDSL