Learning in Structured MDPs with Convex Cost Functions: Improved Regret Bounds for Inventory Management

Shipra Agrawal 0001, Randy Jia. Learning in Structured MDPs with Convex Cost Functions: Improved Regret Bounds for Inventory Management. In Anna Karlin, Nicole Immorlica, Ramesh Johari, editors, Proceedings of the 2019 ACM Conference on Economics and Computation, EC 2019, Phoenix, AZ, USA, June 24-28, 2019. pages 743-744, ACM, 2019. [doi]

Authors

Shipra Agrawal 0001

This author has not been identified. Look up 'Shipra Agrawal 0001' in Google

Randy Jia

This author has not been identified. Look up 'Randy Jia' in Google