STAIR: Learning Sparse Text and Image Representation in Grounded Tokens

Chen Chen, Bowen Zhang, Liangliang Cao, Jiguang Shen, Tom Gunter, Albin Madappally Jose, Alexander Toshev, Yantao Zheng, Jonathon Shlens, Ruoming Pang, Yinfei Yang. STAIR: Learning Sparse Text and Image Representation in Grounded Tokens. In Houda Bouamor, Juan Pino 0001, Kalika Bali, editors, Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, EMNLP 2023, Singapore, December 6-10, 2023. pages 15079-15094, Association for Computational Linguistics, 2023. [doi]

@inproceedings{ChenZCSGJTZSPY23,
  title = {STAIR: Learning Sparse Text and Image Representation in Grounded Tokens},
  author = {Chen Chen and Bowen Zhang and Liangliang Cao and Jiguang Shen and Tom Gunter and Albin Madappally Jose and Alexander Toshev and Yantao Zheng and Jonathon Shlens and Ruoming Pang and Yinfei Yang},
  year = {2023},
  url = {https://aclanthology.org/2023.emnlp-main.932},
  researchr = {https://researchr.org/publication/ChenZCSGJTZSPY23},
  cites = {0},
  citedby = {0},
  pages = {15079-15094},
  booktitle = {Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, EMNLP 2023, Singapore, December 6-10, 2023},
  editor = {Houda Bouamor and Juan Pino 0001 and Kalika Bali},
  publisher = {Association for Computational Linguistics},
  isbn = {979-8-89176-060-8},
}