Balanced Q-learning: Combining the influence of optimistic and pessimistic targets

Thommen George Karimpanal, Hung Le, Majid Abdolshah, Santu Rana, Sunil Gupta 0001, Truyen Tran 0001, Svetha Venkatesh. Balanced Q-learning: Combining the influence of optimistic and pessimistic targets. Artificial Intelligence, 325:104021, December 2023. [doi]

Abstract

Abstract is missing.