TinyGiantVLM: A Lightweight Vision-Language Architecture for Spatial Reasoning under Resource Constraints

Vinh-Thuan Ly, Hoang M. Truong, Xuan-Huong Nguyen. TinyGiantVLM: A Lightweight Vision-Language Architecture for Spatial Reasoning under Resource Constraints. In IEEE/CVF International Conference on Computer Vision, ICCV 2025 - Workshops, Honolulu, HI, USA, October 19-20, 2025. pages 5508-5515, IEEE, 2025. [doi]

@inproceedings{LyTN25,
  title = {TinyGiantVLM: A Lightweight Vision-Language Architecture for Spatial Reasoning under Resource Constraints},
  author = {Vinh-Thuan Ly and Hoang M. Truong and Xuan-Huong Nguyen},
  year = {2025},
  doi = {10.1109/ICCVW69036.2025.00577},
  url = {https://doi.org/10.1109/ICCVW69036.2025.00577},
  researchr = {https://researchr.org/publication/LyTN25},
  cites = {0},
  citedby = {0},
  pages = {5508-5515},
  booktitle = {IEEE/CVF International Conference on Computer Vision, ICCV 2025 - Workshops, Honolulu, HI, USA, October 19-20, 2025},
  publisher = {IEEE},
  isbn = {979-8-3315-8988-2},
}