When Large Vision Language Models Meet Multimodal Sequential Recommendation: An Empirical Study

Peilin Zhou, Chao Liu 0001, Jing Ren, Xinfeng Zhou, Yueqi Xie, Meng Cao, Zhongtao Rao, You-Liang Huang, Dading Chong, Junling Liu, Jae Boum Kim, Shoujin Wang, Raymond Chi-Wing Wong, Sunghun Kim 0001. When Large Vision Language Models Meet Multimodal Sequential Recommendation: An Empirical Study. In Guodong Long, Michale Blumestein, Yi Chang 0001, Liane Lewin-Eytan, Zi Helen Huang, Elad Yom-Tov, editors, Proceedings of the ACM on Web Conference 2025, WWW 2025, Sydney, NSW, Australia, 28 April 2025- 2 May 2025. pages 275-292, ACM, 2025. [doi]

@inproceedings{ZhouLRZXCRHCLKW25,
  title = {When Large Vision Language Models Meet Multimodal Sequential Recommendation: An Empirical Study},
  author = {Peilin Zhou and Chao Liu 0001 and Jing Ren and Xinfeng Zhou and Yueqi Xie and Meng Cao and Zhongtao Rao and You-Liang Huang and Dading Chong and Junling Liu and Jae Boum Kim and Shoujin Wang and Raymond Chi-Wing Wong and Sunghun Kim 0001},
  year = {2025},
  doi = {10.1145/3696410.3714764},
  url = {https://doi.org/10.1145/3696410.3714764},
  researchr = {https://researchr.org/publication/ZhouLRZXCRHCLKW25},
  cites = {0},
  citedby = {0},
  pages = {275-292},
  booktitle = {Proceedings of the ACM on Web Conference 2025, WWW 2025, Sydney, NSW, Australia, 28 April 2025- 2 May 2025},
  editor = {Guodong Long and Michale Blumestein and Yi Chang 0001 and Liane Lewin-Eytan and Zi Helen Huang and Elad Yom-Tov},
  publisher = {ACM},
  isbn = {979-8-4007-1274-6},
}