Octopus: A Multi-modal LLM with Parallel Recognition and Sequential Understanding

Chuyang Zhao, Yuxin Song, Junru Chen, Kang Rong, Haocheng Feng, Gang Zhang, Shufan Ji, Jingdong Wang 0001, Errui Ding, Yifan Sun 0003. Octopus: A Multi-modal LLM with Parallel Recognition and Sequential Understanding. In Amir Globersons, Lester Mackey, Danielle Belgrave, Angela Fan, Ulrich Paquet, Jakub M. Tomczak, Cheng Zhang 0005, editors, Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, NeurIPS 2024, Vancouver, BC, Canada, December 10 - 15, 2024. 2024. [doi]

Authors

Chuyang Zhao

This author has not been identified. Look up 'Chuyang Zhao' in Google

Yuxin Song

This author has not been identified. Look up 'Yuxin Song' in Google

Junru Chen

This author has not been identified. Look up 'Junru Chen' in Google

Kang Rong

This author has not been identified. Look up 'Kang Rong' in Google

Haocheng Feng

This author has not been identified. Look up 'Haocheng Feng' in Google

Gang Zhang

This author has not been identified. Look up 'Gang Zhang' in Google

Shufan Ji

This author has not been identified. Look up 'Shufan Ji' in Google

Jingdong Wang 0001

This author has not been identified. Look up 'Jingdong Wang 0001' in Google

Errui Ding

This author has not been identified. Look up 'Errui Ding' in Google

Yifan Sun 0003

This author has not been identified. Look up 'Yifan Sun 0003' in Google