Multi-objective reinforcement learning method for acquiring all pareto optimal policies simultaneously

Yusuke Mukai, Yasuaki Kuroe, Hitoshi Iima. Multi-objective reinforcement learning method for acquiring all pareto optimal policies simultaneously. In Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, SMC 2012, Seoul, Korea (South), October 14-17, 2012. pages 1917-1923, IEEE, 2012. [doi]

Authors

Yusuke Mukai

This author has not been identified. Look up 'Yusuke Mukai' in Google

Yasuaki Kuroe

This author has not been identified. Look up 'Yasuaki Kuroe' in Google

Hitoshi Iima

This author has not been identified. Look up 'Hitoshi Iima' in Google