The Hidden Risks of Large Reasoning Models: A Safety Assessment of R1

Kaiwen Zhou 0002, Chengzhi Liu, Xuandong Zhao, Shreedhar Jangam, Jayanth Srinivasa, Gaowen Liu, Dawn Song, Xin Eric Wang. The Hidden Risks of Large Reasoning Models: A Safety Assessment of R1. In Kentaro Inui, Sakriani Sakti, Haofen Wang, Derek F. Wong, Pushpak Bhattacharyya, Biplab Banerjee, Asif Ekbal, Tanmoy Chakraborty 0002, Dhirendra Pratap Singh, editors, Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, IJCNLP-AACL 2025, Mumbai, India, December 20-24, 2025. pages 3250-3265, The Asian Federation of Natural Language Processing and The Association for Computational Linguistics, 2025. [doi]

Abstract

Abstract is missing.