Quantum Policy Iteration via Amplitude Estimation and Grover Search - Towards Quantum Advantage for Reinforcement Learning

Simon Wiedemann, Daniel Hein 0001, Steffen Udluft, Christian B. Mendl. Quantum Policy Iteration via Amplitude Estimation and Grover Search - Towards Quantum Advantage for Reinforcement Learning. Trans. Mach. Learn. Res., 2023, 2023. [doi]

Abstract

Abstract is missing.