MambaVLA: A Scalable and Efficient Vision-Language-Action Model with State Space Architecture

Sai Navaneet Peddapalli, Manisha Lingala, Sangmoon Lee, Ju H. Park. MambaVLA: A Scalable and Efficient Vision-Language-Action Model with State Space Architecture. In 23rd Consumer Communications & Networking Conference, CCNC 2026, Las Vegas, NV, USA, January 9-12, 2026. pages 1-4, IEEE, 2026. [doi]

Abstract

Abstract is missing.