Smoothed Action Value Functions for Learning Gaussian Policies

Ofir Nachum, Mohammad Norouzi 0002, George Tucker, Dale Schuurmans. Smoothed Action Value Functions for Learning Gaussian Policies. In Jennifer G. Dy, Andreas Krause 0001, editors, Proceedings of the 35th International Conference on Machine Learning, ICML 2018, Stockholmsmässan, Stockholm, Sweden, July 10-15, 2018. Volume 80 of JMLR Workshop and Conference Proceedings, pages 3689-3697, JMLR.org, 2018. [doi]

@inproceedings{Nachum0TS18,
  title = {Smoothed Action Value Functions for Learning Gaussian Policies},
  author = {Ofir Nachum and Mohammad Norouzi 0002 and George Tucker and Dale Schuurmans},
  year = {2018},
  url = {http://proceedings.mlr.press/v80/nachum18a.html},
  researchr = {https://researchr.org/publication/Nachum0TS18},
  cites = {0},
  citedby = {0},
  pages = {3689-3697},
  booktitle = {Proceedings of the 35th International Conference on Machine Learning, ICML 2018, Stockholmsmässan, Stockholm, Sweden, July 10-15, 2018},
  editor = {Jennifer G. Dy and Andreas Krause 0001},
  volume = {80},
  series = {JMLR Workshop and Conference Proceedings},
  publisher = {JMLR.org},
}