Width-based Lookaheads with Learnt Base Policies and Heuristics Over the Atari-2600 Benchmark

Stefan O'Toole, Nir Lipovetzky, Miquel Ramírez, Adrian R. Pearce. Width-based Lookaheads with Learnt Base Policies and Heuristics Over the Atari-2600 Benchmark. In Marc'Aurelio Ranzato, Alina Beygelzimer, Yann N. Dauphin, Percy Liang, Jennifer Wortman Vaughan, editors, Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual. pages 26536-26547, 2021. [doi]

Authors

Stefan O'Toole

This author has not been identified. Look up 'Stefan O'Toole' in Google

Nir Lipovetzky

This author has not been identified. Look up 'Nir Lipovetzky' in Google

Miquel Ramírez

This author has not been identified. Look up 'Miquel Ramírez' in Google

Adrian R. Pearce

This author has not been identified. Look up 'Adrian R. Pearce' in Google