Mathis Le Bail, Jérémie Dentan, Davide Buscaldi, Sonia Vanier. Unveiling Decision-Making in LLMs for Text Classification : Extraction of influential and interpretable concepts with Sparse Autoencoders. In Vera Demberg, Kentaro Inui, Lluís Marquez, editors, Findings of the Association for Computational Linguistics: EACL 2026, Rabat, Morocco, March 24-29, 2026. pages 2477-2504, Association for Computational Linguistics, 2026. [doi]
Abstract is missing.