Worst Cases Policy Gradients

Yichuan Charlie Tang, Jian Zhang, Ruslan Salakhutdinov. Worst Cases Policy Gradients. In Leslie Pack Kaelbling, Danica Kragic, Komei Sugiura, editors, 3rd Annual Conference on Robot Learning, CoRL 2019, Osaka, Japan, October 30 - November 1, 2019, Proceedings. Volume 100 of Proceedings of Machine Learning Research, pages 1078-1093, PMLR, 2019. [doi]

@inproceedings{TangZS19,
  title = {Worst Cases Policy Gradients},
  author = {Yichuan Charlie Tang and Jian Zhang and Ruslan Salakhutdinov},
  year = {2019},
  url = {http://proceedings.mlr.press/v100/tang20a.html},
  researchr = {https://researchr.org/publication/TangZS19},
  cites = {0},
  citedby = {0},
  pages = {1078-1093},
  booktitle = {3rd Annual Conference on Robot Learning, CoRL 2019, Osaka, Japan, October 30 - November 1, 2019, Proceedings},
  editor = {Leslie Pack Kaelbling and Danica Kragic and Komei Sugiura},
  volume = {100},
  series = {Proceedings of Machine Learning Research},
  publisher = {PMLR},
}