Towards Variance Reduction for Reinforcement Learning of Industrial Decision-making Tasks: A Bi-Critic based Demand-Constraint Decoupling Approach

Jianyong Yuan 0032, Jiayi Zhang, Zinuo Cai, Junchi Yan. Towards Variance Reduction for Reinforcement Learning of Industrial Decision-making Tasks: A Bi-Critic based Demand-Constraint Decoupling Approach. In Ambuj Singh, Yizhou Sun, Leman Akoglu, Dimitrios Gunopulos, Xifeng Yan, Ravi Kumar 0001, Fatma Ozcan, Jieping Ye, editors, Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2023, Long Beach, CA, USA, August 6-10, 2023. pages 3162-3172, ACM, 2023. [doi]

Authors

Jianyong Yuan 0032

This author has not been identified. Look up 'Jianyong Yuan 0032' in Google

Jiayi Zhang

This author has not been identified. Look up 'Jiayi Zhang' in Google

Zinuo Cai

This author has not been identified. Look up 'Zinuo Cai' in Google

Junchi Yan

This author has not been identified. Look up 'Junchi Yan' in Google