Online Learning with Implicit Exploration in Episodic Markov Decision Processes

Mahsa Ghasemi, Abolfazl Hashemi, Haris Vikalo, Ufuk Topcu. Online Learning with Implicit Exploration in Episodic Markov Decision Processes. In 2021 American Control Conference, ACC 2021, New Orleans, LA, USA, May 25-28, 2021. pages 1953-1958, IEEE, 2021. [doi]

@inproceedings{GhasemiHVT21,
  title = {Online Learning with Implicit Exploration in Episodic Markov Decision Processes},
  author = {Mahsa Ghasemi and Abolfazl Hashemi and Haris Vikalo and Ufuk Topcu},
  year = {2021},
  doi = {10.23919/ACC50511.2021.9483085},
  url = {https://doi.org/10.23919/ACC50511.2021.9483085},
  researchr = {https://researchr.org/publication/GhasemiHVT21},
  cites = {0},
  citedby = {0},
  pages = {1953-1958},
  booktitle = {2021 American Control Conference, ACC 2021, New Orleans, LA, USA, May 25-28, 2021},
  publisher = {IEEE},
  isbn = {978-1-6654-4197-1},
}