A Heuristic Q-Learning Architecture for Fully Exploring a World and Deriving an Optimal Policy by Model-Based Planning - researchr publication

researchr

You are not signed in
Sign in
Sign up

Gang Zhao, Shoji Tatsumi, Ruoying Sun. A Heuristic Q-Learning Architecture for Fully Exploring a World and Deriving an Optimal Policy by Model-Based Planning. In ICRA. pages 2078-2083, 1999.

Abstract is missing.

runs on WebDSL