Evaluating Concept Discovery Methods for Sensitive Attributes in Language Models

Sarah Schröder, Alexander Schulz 0001, Barbara Hammer. Evaluating Concept Discovery Methods for Sensitive Attributes in Language Models. In 33rd European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning, ESANN 2025, Bruges, Belgium, April 23-25, 2025. 2025. [doi]

Abstract

Abstract is missing.