Balanced Q-learning: Combining the influence of optimistic and pessimistic targets

Thommen George Karimpanal, Hung Le, Majid Abdolshah, Santu Rana, Sunil Gupta 0001, Truyen Tran 0001, Svetha Venkatesh. Balanced Q-learning: Combining the influence of optimistic and pessimistic targets. Artificial Intelligence, 325:104021, December 2023. [doi]

@article{KarimpanalLARGTV23,
  title = {Balanced Q-learning: Combining the influence of optimistic and pessimistic targets},
  author = {Thommen George Karimpanal and Hung Le and Majid Abdolshah and Santu Rana and Sunil Gupta 0001 and Truyen Tran 0001 and Svetha Venkatesh},
  year = {2023},
  month = {December},
  doi = {10.1016/j.artint.2023.104021},
  url = {https://doi.org/10.1016/j.artint.2023.104021},
  researchr = {https://researchr.org/publication/KarimpanalLARGTV23},
  cites = {0},
  citedby = {0},
  journal = {Artificial Intelligence},
  volume = {325},
  pages = {104021},
}