VLA-Grasp: a vision-language-action modeling with cross-modality fusion for task-oriented grasping

Jianwei Zhu, Xueying Sun, Qiang Zhang 0044, Mingmin Liu. VLA-Grasp: a vision-language-action modeling with cross-modality fusion for task-oriented grasping. Complex Intell. Syst., 11(6), 2025. [doi]

Abstract

Abstract is missing.