Learning in Structured MDPs with Convex Cost Functions: Improved Regret Bounds for Inventory Management

Shipra Agrawal 0001, Randy Jia. Learning in Structured MDPs with Convex Cost Functions: Improved Regret Bounds for Inventory Management. In Anna Karlin, Nicole Immorlica, Ramesh Johari, editors, Proceedings of the 2019 ACM Conference on Economics and Computation, EC 2019, Phoenix, AZ, USA, June 24-28, 2019. pages 743-744, ACM, 2019. [doi]

Abstract

Abstract is missing.