An efficient algorithm for contextual bandits with knapsacks, and an extension to concave objectives

Shipra Agrawal, Nikhil R. Devanur, Lihong Li. An efficient algorithm for contextual bandits with knapsacks, and an extension to concave objectives. In Vitaly Feldman, Alexander Rakhlin, Ohad Shamir, editors, Proceedings of the 29th Conference on Learning Theory, COLT 2016, New York, USA, June 23-26, 2016. Volume 49 of JMLR Workshop and Conference Proceedings, pages 4-18, JMLR.org, 2016. [doi]

Authors

Shipra Agrawal

This author has not been identified. Look up 'Shipra Agrawal' in Google

Nikhil R. Devanur

This author has not been identified. Look up 'Nikhil R. Devanur' in Google

Lihong Li

This author has not been identified. Look up 'Lihong Li' in Google