Dual Policy Iteration

Wen Sun 0002, Geoffrey J. Gordon, Byron Boots, J. Andrew Bagnell. Dual Policy Iteration. In Samy Bengio, Hanna M. Wallach, Hugo Larochelle, Kristen Grauman, Nicolò Cesa-Bianchi, Roman Garnett, editors, Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, NeurIPS 2018, 3-8 December 2018, Montréal, Canada. pages 7059-7069, 2018. [doi]

@inproceedings{0002GBB18,
  title = {Dual Policy Iteration},
  author = {Wen Sun 0002 and Geoffrey J. Gordon and Byron Boots and J. Andrew Bagnell},
  year = {2018},
  url = {http://papers.nips.cc/paper/7937-dual-policy-iteration},
  researchr = {https://researchr.org/publication/0002GBB18},
  cites = {0},
  citedby = {0},
  pages = {7059-7069},
  booktitle = {Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, NeurIPS 2018, 3-8 December 2018, Montréal, Canada},
  editor = {Samy Bengio and Hanna M. Wallach and Hugo Larochelle and Kristen Grauman and Nicolò Cesa-Bianchi and Roman Garnett},
}