Mitigating Fine-Grained Hallucination by Fine-Tuning Large Vision-Language Models with Caption Rewrites

Lei Wang 0185, Jiabang He, Shenshen Li, Ning Liu, Ee-Peng Lim. Mitigating Fine-Grained Hallucination by Fine-Tuning Large Vision-Language Models with Caption Rewrites. In Stevan Rudinac, Alan Hanjalic, Cynthia C. S. Liem, Marcel Worring, Björn Þór Jónsson 0001, Bei Liu, Yoko Yamakata, editors, MultiMedia Modeling - 30th International Conference, MMM 2024, Amsterdam, The Netherlands, January 29 - February 2, 2024, Proceedings, Part IV. Volume 14557 of Lecture Notes in Computer Science, pages 32-45, Springer, 2024. [doi]

Authors

Lei Wang 0185

This author has not been identified. Look up 'Lei Wang 0185' in Google

Jiabang He

This author has not been identified. Look up 'Jiabang He' in Google

Shenshen Li

This author has not been identified. Look up 'Shenshen Li' in Google

Ning Liu

This author has not been identified. Look up 'Ning Liu' in Google

Ee-Peng Lim

This author has not been identified. It may be one of the following persons: Look up 'Ee-Peng Lim' in Google