A Data-Efficient Visual-Audio Representation with Intuitive Fine-tuning for Voice-Controlled Robots

Peixin Chang, Shuijing Liu, Tianchen Ji, Neeloy Chakraborty, Kaiwen Hong, Katherine Rose Driggs-Campbell. A Data-Efficient Visual-Audio Representation with Intuitive Fine-tuning for Voice-Controlled Robots. In Jie Tan, Marc Toussaint, Kourosh Darvish, editors, Conference on Robot Learning, CoRL 2023, 6-9 November 2023, Atlanta, GA, USA. Volume 229 of Proceedings of Machine Learning Research, pages 1797-1819, PMLR, 2023. [doi]

@inproceedings{ChangLJCHD23,
  title = {A Data-Efficient Visual-Audio Representation with Intuitive Fine-tuning for Voice-Controlled Robots},
  author = {Peixin Chang and Shuijing Liu and Tianchen Ji and Neeloy Chakraborty and Kaiwen Hong and Katherine Rose Driggs-Campbell},
  year = {2023},
  url = {https://proceedings.mlr.press/v229/chang23a.html},
  researchr = {https://researchr.org/publication/ChangLJCHD23},
  cites = {0},
  citedby = {0},
  pages = {1797-1819},
  booktitle = {Conference on Robot Learning, CoRL 2023, 6-9 November 2023, Atlanta, GA, USA},
  editor = {Jie Tan and Marc Toussaint and Kourosh Darvish},
  volume = {229},
  series = {Proceedings of Machine Learning Research},
  publisher = {PMLR},
}