Profit sharing that can learn deterministic policy for POMDPs environments

Yohei Takamori, Yuko Osana. Profit sharing that can learn deterministic policy for POMDPs environments. In Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, Anchorage, Alaska, USA, October 9-12, 2011. pages 490-495, IEEE, 2011. [doi]

Authors

Yohei Takamori

This author has not been identified. Look up 'Yohei Takamori' in Google

Yuko Osana

This author has not been identified. Look up 'Yuko Osana' in Google