VIMA: Robot Manipulation with Multimodal Prompts

Yunfan Jiang, Agrim Gupta, Zichen Zhang 0011, Guanzhi Wang, Yongqiang Dou, Yanjun Chen, Li Fei-Fei 0001, Anima Anandkumar, Yuke Zhu, Linxi Fan. VIMA: Robot Manipulation with Multimodal Prompts. In Andreas Krause 0001, Emma Brunskill, KyungHyun Cho, Barbara Engelhardt, Sivan Sabato, Jonathan Scarlett, editors, International Conference on Machine Learning, ICML 2023, 23-29 July 2023, Honolulu, Hawaii, USA. Volume 202 of Proceedings of Machine Learning Research, pages 14975-15022, PMLR, 2023. [doi]

Authors

Yunfan Jiang

This author has not been identified. Look up 'Yunfan Jiang' in Google

Agrim Gupta

This author has not been identified. Look up 'Agrim Gupta' in Google

Zichen Zhang 0011

This author has not been identified. Look up 'Zichen Zhang 0011' in Google

Guanzhi Wang

This author has not been identified. Look up 'Guanzhi Wang' in Google

Yongqiang Dou

This author has not been identified. Look up 'Yongqiang Dou' in Google

Yanjun Chen

This author has not been identified. Look up 'Yanjun Chen' in Google

Li Fei-Fei 0001

This author has not been identified. Look up 'Li Fei-Fei 0001' in Google

Anima Anandkumar

This author has not been identified. Look up 'Anima Anandkumar' in Google

Yuke Zhu

This author has not been identified. Look up 'Yuke Zhu' in Google

Linxi Fan

This author has not been identified. Look up 'Linxi Fan' in Google