Using chains of bottleneck transitions to decompose and solve reinforcement learning tasks with hidden states

Hüseyin Aydin, Erkin Çilden, Faruk Polat. Using chains of bottleneck transitions to decompose and solve reinforcement learning tasks with hidden states. Future Generation Comp. Syst., 133:153-168, 2022. [doi]

Abstract

Abstract is missing.