Multimodal Adversarial Defense for Vision-Language Models by Leveraging One-To-Many Relationships

Futa Waseda, Antonio Tejero-de-Pablos, Isao Echizen. Multimodal Adversarial Defense for Vision-Language Models by Leveraging One-To-Many Relationships. In IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2026, Tucson, AZ, USA, March 6-10, 2026. pages 6968-6977, IEEE, 2026. [doi]

Abstract

Abstract is missing.