Linear Steerability in Language Models: When It Emerges and How It Evolves

Jianshu She, Xinyue Li, Eric P. Xing, Zhengzhong Liu 0001, Qirong Ho. Linear Steerability in Language Models: When It Emerges and How It Evolves. In Christos Christodoulopoulos 0001, Tanmoy Chakraborty 0002, Carolyn Rose, Violet Peng, editors, Findings of the Association for Computational Linguistics: EMNLP 2025, Suzhou, China, November 4-9, 2025. pages 17821-17846, Association for Computational Linguistics, 2025. [doi]

Abstract

Abstract is missing.