Profit sharing that can learn deterministic policy for POMDPs environments

Yohei Takamori, Yuko Osana. Profit sharing that can learn deterministic policy for POMDPs environments. In Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, Anchorage, Alaska, USA, October 9-12, 2011. pages 490-495, IEEE, 2011. [doi]

Possibly Related Publications

The following publications are possibly variants of this publication: