A gradient-based reinforcement learning approach to dynamic pricing in partially-observable environments

David Vengerov. A gradient-based reinforcement learning approach to dynamic pricing in partially-observable environments. Future Generation Comp. Syst., 24(7):687-693, 2008. [doi]

Abstract

Abstract is missing.