Profit sharing that can learn deterministic policy for POMDPs environments

Yohei Takamori, Yuko Osana. Profit sharing that can learn deterministic policy for POMDPs environments. In Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, Anchorage, Alaska, USA, October 9-12, 2011. pages 490-495, IEEE, 2011. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.