Transformer vision-language tracking via proxy token guided cross-modal fusion

Haojie Zhao, Xiao Wang 0014, Dong Wang 0004, Huchuan Lu, Xiang Ruan. Transformer vision-language tracking via proxy token guided cross-modal fusion. Pattern Recognition Letters, 168:10-16, April 2023. [doi]

Authors

Haojie Zhao

This author has not been identified. Look up 'Haojie Zhao' in Google

Xiao Wang 0014

This author has not been identified. Look up 'Xiao Wang 0014' in Google

Dong Wang 0004

This author has not been identified. Look up 'Dong Wang 0004' in Google

Huchuan Lu

This author has not been identified. Look up 'Huchuan Lu' in Google

Xiang Ruan

This author has not been identified. Look up 'Xiang Ruan' in Google