Understanding Large Language Model Behaviors Through Interactive Counterfactual Generation and Analysis

Furui Cheng, Vilém Zouhar, Robin Shing Moon Chan, Daniel Fürst, Hendrik Strobelt, Mennatallah El-Assady. Understanding Large Language Model Behaviors Through Interactive Counterfactual Generation and Analysis. IEEE Trans. Vis. Comput. Graph., 32(1):846-856, January 2026. [doi]

Abstract

Abstract is missing.