RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning

Xiaojian Ma, Weili Nie, Zhiding Yu, Huaizu Jiang, Chaowei Xiao, Yuke Zhu, Song Chun Zhu, Anima Anandkumar. RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning. In The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022. OpenReview.net, 2022. [doi]

Authors

Xiaojian Ma

This author has not been identified. Look up 'Xiaojian Ma' in Google

Weili Nie

This author has not been identified. Look up 'Weili Nie' in Google

Zhiding Yu

This author has not been identified. Look up 'Zhiding Yu' in Google

Huaizu Jiang

This author has not been identified. Look up 'Huaizu Jiang' in Google

Chaowei Xiao

This author has not been identified. Look up 'Chaowei Xiao' in Google

Yuke Zhu

This author has not been identified. Look up 'Yuke Zhu' in Google

Song Chun Zhu

This author has not been identified. Look up 'Song Chun Zhu' in Google

Anima Anandkumar

This author has not been identified. Look up 'Anima Anandkumar' in Google