Online learning in MDPs with linear function approximation and bandit feedback - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Gergely Neu, Julia Olkhovskaya. Online learning in MDPs with linear function approximation and bandit feedback. In Marc'Aurelio Ranzato, Alina Beygelzimer, Yann N. Dauphin, Percy Liang, Jennifer Wortman Vaughan, editors, Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual. pages 10407-10417, 2021. [doi]

This author has not been identified. Look up 'Gergely Neu' in GoogleThis author has not been identified. Look up 'Julia Olkhovskaya' in Google

runs on WebDSL