Improving Policy Gradient Estimates with Influence Information

Jervis Pinto, Alan Fern, Tim Bauer, Martin Erwig. Improving Policy Gradient Estimates with Influence Information. Journal of Machine Learning Research, 20:1-18, 2011. [doi]

Authors

Jervis Pinto

This author has not been identified. Look up 'Jervis Pinto' in Google

Alan Fern

This author has not been identified. Look up 'Alan Fern' in Google

Tim Bauer

This author has not been identified. Look up 'Tim Bauer' in Google

Martin Erwig

This author has not been identified. It may be one of the following persons: Look up 'Martin Erwig' in Google