Francisco Rivera Valverde, Juana Valeria Hurtado, Abhinav Valada. There Is More Than Meets the Eye: Self-Supervised Multi-Object Detection and Tracking With Sound by Distilling Multimodal Knowledge. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2021, virtual, June 19-25, 2021. pages 11612-11621, Computer Vision Foundation / IEEE, 2021. [doi]
Abstract is missing.