Constructing Stochastic Mixture Policies for Episodic Multiobjective Reinforcement Learning Tasks

Peter Vamplew, Richard Dazeley, Ewan Barker, Andrei Kelarev. Constructing Stochastic Mixture Policies for Episodic Multiobjective Reinforcement Learning Tasks. In Ann E. Nicholson, Xiaodong Li, editors, AI 2009: Advances in Artificial Intelligence, 22nd Australasian Joint Conference, Melbourne, Australia, December 1-4, 2009. Proceedings. Volume 5866 of Lecture Notes in Computer Science, pages 340-349, Springer, 2009. [doi]

Authors

Peter Vamplew

This author has not been identified. Look up 'Peter Vamplew' in Google

Richard Dazeley

This author has not been identified. Look up 'Richard Dazeley' in Google

Ewan Barker

This author has not been identified. Look up 'Ewan Barker' in Google

Andrei Kelarev

This author has not been identified. Look up 'Andrei Kelarev' in Google