PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Alekh Agarwal, Mikael Henaff, Sham M. Kakade, Wen Sun. PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning. In Hugo Larochelle, Marc'Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, Hsuan-Tien Lin, editors, Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual. 2020. [doi]

This author has not been identified. Look up 'Alekh Agarwal' in GoogleThis author has not been identified. Look up 'Mikael Henaff' in GoogleThis author has not been identified. Look up 'Sham M. Kakade' in GoogleThis author has not been identified. Look up 'Wen Sun' in Google

runs on WebDSL