Triple-Q: A Model-Free Algorithm for Constrained Reinforcement Learning with Sublinear Regret and Zero Constraint Violation

Honghao Wei, Xin Liu 0049, Lei Ying. Triple-Q: A Model-Free Algorithm for Constrained Reinforcement Learning with Sublinear Regret and Zero Constraint Violation. In Gustau Camps-Valls, Francisco J. R. Ruiz, Isabel Valera, editors, International Conference on Artificial Intelligence and Statistics, AISTATS 2022, 28-30 March 2022, Virtual Event. Volume 151 of Proceedings of Machine Learning Research, pages 3274-3307, PMLR, 2022. [doi]

@inproceedings{WeiLY22,
  title = {Triple-Q: A Model-Free Algorithm for Constrained Reinforcement Learning with Sublinear Regret and Zero Constraint Violation},
  author = {Honghao Wei and Xin Liu 0049 and Lei Ying},
  year = {2022},
  url = {https://proceedings.mlr.press/v151/wei22a.html},
  researchr = {https://researchr.org/publication/WeiLY22},
  cites = {0},
  citedby = {0},
  pages = {3274-3307},
  booktitle = {International Conference on Artificial Intelligence and Statistics, AISTATS 2022, 28-30 March 2022, Virtual Event},
  editor = {Gustau Camps-Valls and Francisco J. R. Ruiz and Isabel Valera},
  volume = {151},
  series = {Proceedings of Machine Learning Research},
  publisher = {PMLR},
}