Rethinking the Mixture of Vision Encoders Paradigm for Enhanced Visual Understanding in Multimodal LLMs

Mozhgan Nasr Azadani, James Riddell, Sean Sedwards, Krzysztof Czarnecki 0001. Rethinking the Mixture of Vision Encoders Paradigm for Enhanced Visual Understanding in Multimodal LLMs. Trans. Mach. Learn. Res., 2026, 2026. [doi]

Authors

Mozhgan Nasr Azadani

This author has not been identified. Look up 'Mozhgan Nasr Azadani' in Google

James Riddell

This author has not been identified. Look up 'James Riddell' in Google

Sean Sedwards

This author has not been identified. Look up 'Sean Sedwards' in Google

Krzysztof Czarnecki 0001

This author has not been identified. Look up 'Krzysztof Czarnecki 0001' in Google