Infinite time horizon maximum causal entropy inverse reinforcement learning

Michael Bloem, Nicholas Bambos. Infinite time horizon maximum causal entropy inverse reinforcement learning. In 53rd IEEE Conference on Decision and Control, CDC 2014, Los Angeles, CA, USA, December 15-17, 2014. pages 4911-4916, IEEE, 2014. [doi]

Authors

Michael Bloem

This author has not been identified. Look up 'Michael Bloem' in Google

Nicholas Bambos

This author has not been identified. Look up 'Nicholas Bambos' in Google