DT4LM: Differential Testing for Reliable Language Model Updates in Classification Tasks

Xinyue Zuo, Yan Xiao 0002, Xiaochun Cao, Wenya Wang 0001, Jin Song Dong 0001. DT4LM: Differential Testing for Reliable Language Model Updates in Classification Tasks. IEEE Trans. Software Eng., 51(12):3558-3573, December 2025. [doi]

Abstract

Abstract is missing.