Aviral Kumar, Rishabh Agarwal, Tengyu Ma 0001, Aaron C. Courville, George Tucker, Sergey Levine. DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization. In The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022. OpenReview.net, 2022. [doi]
Abstract is missing.