Yohei Takamori, Yuko Osana. Profit sharing that can learn deterministic policy for POMDPs environments. In Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, Anchorage, Alaska, USA, October 9-12, 2011. pages 490-495, IEEE, 2011. [doi]