B-Coder: Value-Based Deep Reinforcement Learning for Program Synthesis

Zishun Yu, Yunzhe Tao, Liyu Chen, Tao Sun, Hongxia Yang. B-Coder: Value-Based Deep Reinforcement Learning for Program Synthesis. In The Twelfth International Conference on Learning Representations, ICLR 2024, Vienna, Austria, May 7-11, 2024. OpenReview.net, 2024. [doi]

Authors

Zishun Yu

This author has not been identified. Look up 'Zishun Yu' in Google

Yunzhe Tao

This author has not been identified. Look up 'Yunzhe Tao' in Google

Liyu Chen

This author has not been identified. Look up 'Liyu Chen' in Google

Tao Sun

This author has not been identified. Look up 'Tao Sun' in Google

Hongxia Yang

This author has not been identified. Look up 'Hongxia Yang' in Google