Dynamic Hierarchical Token Merging for Vision Transformers

Karim Haroun, Thibault Allenet, Karim Ben Chehida, Jean Martinet. Dynamic Hierarchical Token Merging for Vision Transformers. In Thomas Bashford-Rogers, Daniel Meneveaux, Mehdi Ammi, Mounia Ziat, Stefan Jänicke, Helen C. Purchase, Petia Radeva, Antonino Furnari, Kadi Bouatouch, A. Augusto de Sousa, editors, Proceedings of the 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, VISIGRAPP 2025 - Volume 3: VISAPP, Porto, Portugal, February 26-28, 2025. pages 677-684, SCITEPRESS, 2025. [doi]

Abstract

Abstract is missing.