A gradient-based reinforcement learning approach to dynamic pricing in partially-observable environments

David Vengerov. A gradient-based reinforcement learning approach to dynamic pricing in partially-observable environments. Future Generation Comp. Syst., 24(7):687-693, 2008. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.