STAR-1: Safer Alignment of Reasoning LLMs with 1K Data - researchr publication

researchr

You are not signed in
Sign in
Sign up

Zijun Wang, Haoqin Tu, Yuhan Wang 0001, Juncheng Wu, Yanqing Liu, Jieru Mei, Brian R. Bartoldson, Bhavya Kailkhura, Cihang Xie. STAR-1: Safer Alignment of Reasoning LLMs with 1K Data. In Sven Koenig, Chad Jenkins, Matthew E. Taylor, editors, Fortieth AAAI Conference on Artificial Intelligence, Thirty-Eighth Conference on Innovative Applications of Artificial Intelligence, Sixteenth Symposium on Educational Advances in Artificial Intelligence, AAAI 2026, Singapore, January 20-27, 2026. pages 37988-37997, AAAI Press, 2026. [doi]

Abstract is missing.

runs on WebDSL