Online Learning in Weakly Coupled Markov Decision Processes: A Convergence Time Study

Xiaohan Wei, Hao Yu 0002, Michael J. Neely. Online Learning in Weakly Coupled Markov Decision Processes: A Convergence Time Study. POMACS, 2(1), 2018. [doi]

@article{WeiYN18,
  title = {Online Learning in Weakly Coupled Markov Decision Processes: A Convergence Time Study},
  author = {Xiaohan Wei and Hao Yu 0002 and Michael J. Neely},
  year = {2018},
  doi = {10.1145/3179415},
  url = {http://doi.acm.org/10.1145/3179415},
  researchr = {https://researchr.org/publication/WeiYN18},
  cites = {0},
  citedby = {0},
  journal = {POMACS},
  volume = {2},
  number = {1},
}