A Unified Multi-modal Structure for Retrieving Tracked Vehicles through Natural Language Descriptions

Dong Xie, Linhu Liu, Shengjun Zhang, Jiang Tian. A Unified Multi-modal Structure for Retrieving Tracked Vehicles through Natural Language Descriptions. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023 - Workshops, Vancouver, BC, Canada, June 17-24, 2023. pages 5419-5427, IEEE, 2023. [doi]

Authors

Dong Xie

This author has not been identified. Look up 'Dong Xie' in Google

Linhu Liu

This author has not been identified. Look up 'Linhu Liu' in Google

Shengjun Zhang

This author has not been identified. Look up 'Shengjun Zhang' in Google

Jiang Tian

This author has not been identified. Look up 'Jiang Tian' in Google