Online fitted policy iteration based on extreme learning machines

Pablo Escandell-Montero, Delia Lorente, José María Martínez-Martínez, Emilio Soria-Olivas, Joan Vila-Francés, José D. Martín-Guerrero. Online fitted policy iteration based on extreme learning machines. Knowl.-Based Syst., 100:200-211, 2016. [doi]

@article{Escandell-Montero16,
  title = {Online fitted policy iteration based on extreme learning machines},
  author = {Pablo Escandell-Montero and Delia Lorente and José María Martínez-Martínez and Emilio Soria-Olivas and Joan Vila-Francés and José D. Martín-Guerrero},
  year = {2016},
  doi = {10.1016/j.knosys.2016.03.007},
  url = {http://dx.doi.org/10.1016/j.knosys.2016.03.007},
  researchr = {https://researchr.org/publication/Escandell-Montero16},
  cites = {0},
  citedby = {0},
  journal = {Knowl.-Based Syst.},
  volume = {100},
  pages = {200-211},
}