AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition

Zhaorun Chen, Zhuokai Zhao, Zhihong Zhu, Ruiqi Zhang, Xiang Li, Bhiksha Raj, Huaxiu Yao. AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition. In Kevin Duh, Helena Gómez-Adorno, Steven Bethard, editors, Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), NAACL 2024, Mexico City, Mexico, June 16-21, 2024. pages 1346-1362, Association for Computational Linguistics, 2024. [doi]

@inproceedings{ChenZZZLRY24,
  title = {AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition},
  author = {Zhaorun Chen and Zhuokai Zhao and Zhihong Zhu and Ruiqi Zhang and Xiang Li and Bhiksha Raj and Huaxiu Yao},
  year = {2024},
  doi = {10.18653/v1/2024.naacl-long.73},
  url = {https://doi.org/10.18653/v1/2024.naacl-long.73},
  researchr = {https://researchr.org/publication/ChenZZZLRY24},
  cites = {0},
  citedby = {0},
  pages = {1346-1362},
  booktitle = {Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), NAACL 2024, Mexico City, Mexico, June 16-21, 2024},
  editor = {Kevin Duh and Helena Gómez-Adorno and Steven Bethard},
  publisher = {Association for Computational Linguistics},
  isbn = {979-8-89176-114-8},
}