Researchr is a web site for finding, collecting, sharing, and reviewing scientific publications, for researchers by researchers.
Sign up for an account to create a profile with publication list, tag and review your related work, and share bibliographies with your co-authors.
Xinning Zhu, Jinxin Du, Longfei Huang, Lunde Chen. DyCoT-RE: Chain-of-Thought-enhanced LLM reward engineering with dual-dynamic optimization for reinforcement learning. Neurocomputing, 695:133945, 2026. [doi]
Abstract is missing.