Two-phase selective decentralization to improve reinforcement learning systems with MDP

Thanh Nguyen 0005, Snehasis Mukhopadhyay. Two-phase selective decentralization to improve reinforcement learning systems with MDP. AI Commun., 31(4):319-337, 2018. [doi]

@article{NguyenM18a,
  title = {Two-phase selective decentralization to improve reinforcement learning systems with MDP},
  author = {Thanh Nguyen 0005 and Snehasis Mukhopadhyay},
  year = {2018},
  doi = {10.3233/AIC-180766},
  url = {https://doi.org/10.3233/AIC-180766},
  researchr = {https://researchr.org/publication/NguyenM18a},
  cites = {0},
  citedby = {0},
  journal = {AI Commun.},
  volume = {31},
  number = {4},
  pages = {319-337},
}