RILA: Reflective and Imaginative Language Agent for Zero-Shot Semantic Audio-Visual Navigation

Zeyuan Yang, Jiageng Lin, Peihao Chen, Anoop Cherian, Tim K. Marks, Jonathan Le Roux, Chuang Gan. RILA: Reflective and Imaginative Language Agent for Zero-Shot Semantic Audio-Visual Navigation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024, Seattle, WA, USA, June 16-22, 2024. pages 16251-16261, IEEE, 2024. [doi]

@inproceedings{YangLCCMRG24,
  title = {RILA: Reflective and Imaginative Language Agent for Zero-Shot Semantic Audio-Visual Navigation},
  author = {Zeyuan Yang and Jiageng Lin and Peihao Chen and Anoop Cherian and Tim K. Marks and Jonathan Le Roux and Chuang Gan},
  year = {2024},
  doi = {10.1109/CVPR52733.2024.01538},
  url = {https://doi.org/10.1109/CVPR52733.2024.01538},
  researchr = {https://researchr.org/publication/YangLCCMRG24},
  cites = {0},
  citedby = {0},
  pages = {16251-16261},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024, Seattle, WA, USA, June 16-22, 2024},
  publisher = {IEEE},
  isbn = {979-8-3503-5300-6},
}