Contextual Bandits with Continuous Actions: Smoothing, Zooming, and Adapting

Akshay Krishnamurthy, John Langford 0001, Aleksandrs Slivkins, Chicheng Zhang. Contextual Bandits with Continuous Actions: Smoothing, Zooming, and Adapting. Journal of Machine Learning Research, 21, 2020. [doi]

@article{Krishnamurthy0S20,
  title = {Contextual Bandits with Continuous Actions: Smoothing, Zooming, and Adapting},
  author = {Akshay Krishnamurthy and John Langford 0001 and Aleksandrs Slivkins and Chicheng Zhang},
  year = {2020},
  url = {http://jmlr.org/papers/v21/19-650.html},
  researchr = {https://researchr.org/publication/Krishnamurthy0S20},
  cites = {0},
  citedby = {0},
  journal = {Journal of Machine Learning Research},
  volume = {21},
}