Xing Chen 0022, Yewen Li, Xiaofeng Cao 0002, Hechang Chen, Hengshuai Yao, Bo An 0001, Yi Chang 0001. Universal Stabilization for Maximum Entropy Optimization in Reinforcement Learning. IEEE Transactions on Neural Networks, 37(4):1851-1863, April 2026. [doi]
Abstract is missing.