TinyGiantVLM: A Lightweight Vision-Language Architecture for Spatial Reasoning under Resource Constraints

Vinh-Thuan Ly, Hoang M. Truong, Xuan-Huong Nguyen. TinyGiantVLM: A Lightweight Vision-Language Architecture for Spatial Reasoning under Resource Constraints. In IEEE/CVF International Conference on Computer Vision, ICCV 2025 - Workshops, Honolulu, HI, USA, October 19-20, 2025. pages 5508-5515, IEEE, 2025. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.