BridgeTower: Building Bridges between Encoders in Vision-Language Representation Learning

Xiao Xu 0005, Chenfei Wu, Shachar Rosenman, Vasudev Lal, Wanxiang Che, Nan Duan. BridgeTower: Building Bridges between Encoders in Vision-Language Representation Learning. In Brian Williams 0001, Yiling Chen 0001, Jennifer Neville, editors, Thirty-Seventh AAAI Conference on Artificial Intelligence, AAAI 2023, Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence, IAAI 2023, Thirteenth Symposium on Educational Advances in Artificial Intelligence, EAAI 2023, Washington, DC, USA, February 7-14, 2023. pages 10637-10647, AAAI Press, 2023. [doi]

@inproceedings{0005WRLCD23,
  title = {BridgeTower: Building Bridges between Encoders in Vision-Language Representation Learning},
  author = {Xiao Xu 0005 and Chenfei Wu and Shachar Rosenman and Vasudev Lal and Wanxiang Che and Nan Duan},
  year = {2023},
  url = {https://ojs.aaai.org/index.php/AAAI/article/view/26263},
  researchr = {https://researchr.org/publication/0005WRLCD23},
  cites = {0},
  citedby = {0},
  pages = {10637-10647},
  booktitle = {Thirty-Seventh AAAI Conference on Artificial Intelligence, AAAI 2023, Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence, IAAI 2023, Thirteenth Symposium on Educational Advances in Artificial Intelligence, EAAI 2023, Washington, DC, USA, February 7-14, 2023},
  editor = {Brian Williams 0001 and Yiling Chen 0001 and Jennifer Neville},
  publisher = {AAAI Press},
  isbn = {978-1-57735-880-0},
}