Chunjiang He, Gang Yang. DiffSynth-LVOS: Enhancing Language-Guided Video Object Segmentation via Diffusion-Based Synthetic Data Generation. In Jakub Lokoc, Ladislav Peska, Jan Zahálka, Stevan Rudinac, Marc A. Kastner 0001, Jingjing Chen, Min-Chun Hu 0001, Jiaxin Wu 0001, Ujjwal Sharma 0001, editors, MultiMedia Modeling - 32nd International Conference on Multimedia Modeling, MMM 2026, Prague, Czech Republic, January 29-31, 2026, Proceedings, Part I. Volume 16412 of Lecture Notes in Computer Science, pages 595-608, Springer, 2026. [doi]
Abstract is missing.