Dissecting Bias in LLMs: A Mechanistic Interpretability Perspective

Zubair Bashir, Bhavik Chandna, Procheta Sen. Dissecting Bias in LLMs: A Mechanistic Interpretability Perspective. Trans. Mach. Learn. Res., 2025, 2025. [doi]

Authors

Zubair Bashir

This author has not been identified. Look up 'Zubair Bashir' in Google

Bhavik Chandna

This author has not been identified. Look up 'Bhavik Chandna' in Google

Procheta Sen

This author has not been identified. Look up 'Procheta Sen' in Google