Researchr is a web site for finding, collecting, sharing, and reviewing scientific publications, for researchers by researchers.
Sign up for an account to create a profile with publication list, tag and review your related work, and share bibliographies with your co-authors.
Gen Li 0005, Yuting Wei, Yuejie Chi, Yuxin Chen 0002. Softmax policy gradient methods can take exponential time to converge. Math. Program., 201(1):707-802, 2023. [doi]
Possibly Related PublicationsThe following publications are possibly variants of this publication: Softmax Policy Gradient Methods Can Take Exponential Time to ConvergeGen Li 0005, Yuting Wei, Yuejie Chi, Yuantao Gu, Yuxin Chen 0002. colt 2021: 3107-3110 [doi]
The following publications are possibly variants of this publication: