Policy teaching through reward function learning

Haoqi Zhang, David C. Parkes, Yiling Chen. Policy teaching through reward function learning. In John Chuang, Lance Fortnow, Pearl Pu, editors, Proceedings 10th ACM Conference on Electronic Commerce (EC-2009), Stanford, California, USA, July 6--10, 2009. pages 295-304, ACM, 2009. [doi]

Authors

Haoqi Zhang

This author has not been identified. Look up 'Haoqi Zhang' in Google

David C. Parkes

This author has not been identified. Look up 'David C. Parkes' in Google

Yiling Chen

This author has not been identified. Look up 'Yiling Chen' in Google