Enhancing Interpretability and Gaining Insights into Robustness in Vision-Language Models Through Core and Spurious Feature Detection via Counterfactuals

Anjon Basak, Adrienne Raglin. Enhancing Interpretability and Gaining Insights into Robustness in Vision-Language Models Through Core and Spurious Feature Detection via Counterfactuals. In Helmut Degen, Stavroula Ntoa, editors, HCI International 2025 - Late Breaking Papers - 27th International Conference on Human-Computer Interaction, HCII 2025, Gothenburg, Sweden, June 22-27, 2025, Proceedings, Part XV. Volume 16345 of Lecture Notes in Computer Science, pages 133-149, Springer, 2025. [doi]

Abstract

Abstract is missing.