Adaptive Batch Size for Safe Policy Gradients

Matteo Papini, Matteo Pirotta, Marcello Restelli. Adaptive Batch Size for Safe Policy Gradients. In Isabelle Guyon, Ulrike von Luxburg, Samy Bengio, Hanna M. Wallach, Rob Fergus, S. V. N. Vishwanathan, Roman Garnett, editors, Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 4-9 December 2017, Long Beach, CA, USA. pages 3594-3603, 2017. [doi]

@inproceedings{PapiniPR17,
  title = {Adaptive Batch Size for Safe Policy Gradients},
  author = {Matteo Papini and Matteo Pirotta and Marcello Restelli},
  year = {2017},
  url = {http://papers.nips.cc/paper/6950-adaptive-batch-size-for-safe-policy-gradients},
  researchr = {https://researchr.org/publication/PapiniPR17},
  cites = {0},
  citedby = {0},
  pages = {3594-3603},
  booktitle = {Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 4-9 December 2017, Long Beach, CA, USA},
  editor = {Isabelle Guyon and Ulrike von Luxburg and Samy Bengio and Hanna M. Wallach and Rob Fergus and S. V. N. Vishwanathan and Roman Garnett},
}