VIMA: Robot Manipulation with Multimodal Prompts

Yunfan Jiang, Agrim Gupta, Zichen Zhang 0011, Guanzhi Wang, Yongqiang Dou, Yanjun Chen, Li Fei-Fei 0001, Anima Anandkumar, Yuke Zhu, Linxi Fan. VIMA: Robot Manipulation with Multimodal Prompts. In Andreas Krause 0001, Emma Brunskill, KyungHyun Cho, Barbara Engelhardt, Sivan Sabato, Jonathan Scarlett, editors, International Conference on Machine Learning, ICML 2023, 23-29 July 2023, Honolulu, Hawaii, USA. Volume 202 of Proceedings of Machine Learning Research, pages 14975-15022, PMLR, 2023. [doi]

Abstract

Abstract is missing.