Training Task Reasoning LLM Agents for Multi-Turn Task Planning via Single-Turn Reinforcement Learning

Hanjiang Hu, Changliu Liu, Na Li 0002, Yebin Wang. Training Task Reasoning LLM Agents for Multi-Turn Task Planning via Single-Turn Reinforcement Learning. IEEE Control Systems Letters, 9:2879-2884, 2025. [doi]

Authors

Hanjiang Hu

This author has not been identified. Look up 'Hanjiang Hu' in Google

Changliu Liu

This author has not been identified. Look up 'Changliu Liu' in Google

Na Li 0002

This author has not been identified. Look up 'Na Li 0002' in Google

Yebin Wang

This author has not been identified. Look up 'Yebin Wang' in Google