Learning and value function approximation in complex decision processes

Benjamin Van Roy. Learning and value function approximation in complex decision processes. PhD thesis, Massachusetts Institute of Technology, Cambridge, MA, USA, 1998. [doi]

Abstract

Abstract is missing.