A Cross-Modal Object-Aware Transformer for Vision-and-Language Navigation

Han Ni, Jia Chen, Dayong Zhu, Dianxi Shi. A Cross-Modal Object-Aware Transformer for Vision-and-Language Navigation. In Marek Z. Reformat, Du Zhang, Nikolaos G. Bourbakis, editors, 34th IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2022, Macao, China, October 31 - November 2, 2022. pages 976-981, IEEE, 2022. [doi]

@inproceedings{NiCZS22,
  title = {A Cross-Modal Object-Aware Transformer for Vision-and-Language Navigation},
  author = {Han Ni and Jia Chen and Dayong Zhu and Dianxi Shi},
  year = {2022},
  doi = {10.1109/ICTAI56018.2022.00149},
  url = {https://doi.org/10.1109/ICTAI56018.2022.00149},
  researchr = {https://researchr.org/publication/NiCZS22},
  cites = {0},
  citedby = {0},
  pages = {976-981},
  booktitle = {34th IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2022, Macao, China, October 31 - November 2, 2022},
  editor = {Marek Z. Reformat and Du Zhang and Nikolaos G. Bourbakis},
  publisher = {IEEE},
  isbn = {979-8-3503-9744-4},
}