Transformer vision-language tracking via proxy token guided cross-modal fusion

Haojie Zhao, Xiao Wang 0014, Dong Wang 0004, Huchuan Lu, Xiang Ruan. Transformer vision-language tracking via proxy token guided cross-modal fusion. Pattern Recognition Letters, 168:10-16, April 2023. [doi]

@article{ZhaoWWLR23,
  title = {Transformer vision-language tracking via proxy token guided cross-modal fusion},
  author = {Haojie Zhao and Xiao Wang 0014 and Dong Wang 0004 and Huchuan Lu and Xiang Ruan},
  year = {2023},
  month = {April},
  doi = {10.1016/j.patrec.2023.02.023},
  url = {https://doi.org/10.1016/j.patrec.2023.02.023},
  researchr = {https://researchr.org/publication/ZhaoWWLR23},
  cites = {0},
  citedby = {0},
  journal = {Pattern Recognition Letters},
  volume = {168},
  pages = {10-16},
}