Defying Distractions in Multimodal Tasks: A Novel Benchmark for Large Vision-Language Models - researchr publication

researchr

You are not signed in
Sign in
Sign up

Jinhui Yang, Ming Jiang 0019, Qi Zhao 0001. Defying Distractions in Multimodal Tasks: A Novel Benchmark for Large Vision-Language Models. IEEE Trans. Pattern Anal. Mach. Intell., 48(6):6314-6331, June 2026. [doi]

Abstract is missing.

runs on WebDSL