Peter Vamplew, Richard Dazeley, Ewan Barker, Andrei Kelarev. Constructing Stochastic Mixture Policies for Episodic Multiobjective Reinforcement Learning Tasks. In Ann E. Nicholson, Xiaodong Li, editors, AI 2009: Advances in Artificial Intelligence, 22nd Australasian Joint Conference, Melbourne, Australia, December 1-4, 2009. Proceedings. Volume 5866 of Lecture Notes in Computer Science, pages 340-349, Springer, 2009. [doi]
Abstract is missing.