Face, Body, Voice: Video Person-Clustering with Multiple Modalities

Andrew Brown 0006, Vicky Kalogeiton, Andrew Zisserman. Face, Body, Voice: Video Person-Clustering with Multiple Modalities. In IEEE/CVF International Conference on Computer Vision Workshops, ICCVW 2021, Montreal, BC, Canada, October 11-17, 2021. pages 3177-3187, IEEE, 2021. [doi]

Abstract

Abstract is missing.