Performance Investigation of UCB Policy in Q-learning

Koki Saito, Akira Notsu, Seiki Ubukata, Katsuhiro Honda. Performance Investigation of UCB Policy in Q-learning. In 14th IEEE International Conference on Machine Learning and Applications, ICMLA 2015, Miami, FL, USA, December 9-11, 2015. pages 777-780, IEEE, 2015. [doi]

Authors

Koki Saito

This author has not been identified. Look up 'Koki Saito' in Google

Akira Notsu

This author has not been identified. Look up 'Akira Notsu' in Google

Seiki Ubukata

This author has not been identified. Look up 'Seiki Ubukata' in Google

Katsuhiro Honda

This author has not been identified. Look up 'Katsuhiro Honda' in Google