Whose Values Prevail? Bias in Large Language Model Value Alignment

Ruoxi Qi, Gleb Papyshev, Kellee Tsai, Antoni B. Chan, Janet H. Hsiao. Whose Values Prevail? Bias in Large Language Model Value Alignment. In David Barner, Neil Bramley, Azzurra Ruggeri, Caren M. Walker, editors, Proceedings of the 47th Annual Meeting of the Cognitive Science Society, CogSci 2025, San Francisco, CA, USA, July 30 - August 2, 2025. cognitivesciencesociety.org, 2025. [doi]

Abstract

Abstract is missing.