Shangdong Yang, Huihui Wang, Shaokang Dong, Xingguo Chen. Leveraging transition exploratory bonus for efficient exploration in Hard-Transiting reinforcement learning problems. Future Generation Comp. Syst., 145:442-453, August 2023. [doi]
Abstract is missing.