Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables

Kate Rakelly, Aurick Zhou, Chelsea Finn, Sergey Levine, Deirdre Quillen. Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables. In Kamalika Chaudhuri, Ruslan Salakhutdinov, editors, Proceedings of the 36th International Conference on Machine Learning, ICML 2019, 9-15 June 2019, Long Beach, California, USA. Volume 97 of Proceedings of Machine Learning Research, pages 5331-5340, PMLR, 2019. [doi]

@inproceedings{RakellyZFLQ19,
  title = {Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables},
  author = {Kate Rakelly and Aurick Zhou and Chelsea Finn and Sergey Levine and Deirdre Quillen},
  year = {2019},
  url = {http://proceedings.mlr.press/v97/rakelly19a.html},
  researchr = {https://researchr.org/publication/RakellyZFLQ19},
  cites = {0},
  citedby = {0},
  pages = {5331-5340},
  booktitle = {Proceedings of the 36th International Conference on Machine Learning, ICML 2019, 9-15 June 2019, Long Beach, California, USA},
  editor = {Kamalika Chaudhuri and Ruslan Salakhutdinov},
  volume = {97},
  series = {Proceedings of Machine Learning Research},
  publisher = {PMLR},
}