Fast reinforcement learning with generalized policy updates

André Barreto, Shaobo Hou, Diana Borsa, David Silver, Doina Precup. Fast reinforcement learning with generalized policy updates. Proc. Natl. Acad. Sci. USA, 117(48):30079-30087, 2020. [doi]

Abstract

Abstract is missing.