Peeling Back the Layers: Interpreting the Storytelling of ViT

Jingjie Zeng, Zhihao Yang, Qi Yang, Liang Yang 0003, Hongfei Lin. Peeling Back the Layers: Interpreting the Storytelling of ViT. In Jianfei Cai 0001, Mohan S. Kankanhalli, Balakrishnan Prabhakaran 0001, Susanne Boll, Ramanathan Subramanian, Liang Zheng 0001, Vivek K. Singh 0001, Pablo César, Lexing Xie, Dong Xu 0001, editors, Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024 - 1 November 2024. pages 7298-7306, ACM, 2024. [doi]

Abstract

Abstract is missing.