Revisiting the Softmax Bellman Operator: New Benefits and New Perspective

Zhao Song, Ronald Parr, Lawrence Carin. Revisiting the Softmax Bellman Operator: New Benefits and New Perspective. In Kamalika Chaudhuri, Ruslan Salakhutdinov, editors, Proceedings of the 36th International Conference on Machine Learning, ICML 2019, 9-15 June 2019, Long Beach, California, USA. Volume 97 of Proceedings of Machine Learning Research, pages 5916-5925, PMLR, 2019. [doi]

@inproceedings{SongPC19,
  title = {Revisiting the Softmax Bellman Operator: New Benefits and New Perspective},
  author = {Zhao Song and Ronald Parr and Lawrence Carin},
  year = {2019},
  url = {http://proceedings.mlr.press/v97/song19c.html},
  researchr = {https://researchr.org/publication/SongPC19},
  cites = {0},
  citedby = {0},
  pages = {5916-5925},
  booktitle = {Proceedings of the 36th International Conference on Machine Learning, ICML 2019, 9-15 June 2019, Long Beach, California, USA},
  editor = {Kamalika Chaudhuri and Ruslan Salakhutdinov},
  volume = {97},
  series = {Proceedings of Machine Learning Research},
  publisher = {PMLR},
}