A Data-Efficient Visual-Audio Representation with Intuitive Fine-tuning for Voice-Controlled Robots

Peixin Chang, Shuijing Liu, Tianchen Ji, Neeloy Chakraborty, Kaiwen Hong, Katherine Rose Driggs-Campbell. A Data-Efficient Visual-Audio Representation with Intuitive Fine-tuning for Voice-Controlled Robots. In Jie Tan, Marc Toussaint, Kourosh Darvish, editors, Conference on Robot Learning, CoRL 2023, 6-9 November 2023, Atlanta, GA, USA. Volume 229 of Proceedings of Machine Learning Research, pages 1797-1819, PMLR, 2023. [doi]

Abstract

Abstract is missing.