ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning

Sangho Lee, Jiwan Chung, Youngjae Yu, Gunhee Kim, Thomas M. Breuel, Gal Chechik, Yale Song. ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning. In 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021, Montreal, QC, Canada, October 10-17, 2021. pages 10254-10264, IEEE, 2021. [doi]

@inproceedings{LeeCYKBCS21,
  title = {ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning},
  author = {Sangho Lee and Jiwan Chung and Youngjae Yu and Gunhee Kim and Thomas M. Breuel and Gal Chechik and Yale Song},
  year = {2021},
  doi = {10.1109/ICCV48922.2021.01011},
  url = {https://doi.org/10.1109/ICCV48922.2021.01011},
  researchr = {https://researchr.org/publication/LeeCYKBCS21},
  cites = {0},
  citedby = {0},
  pages = {10254-10264},
  booktitle = {2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021, Montreal, QC, Canada, October 10-17, 2021},
  publisher = {IEEE},
  isbn = {978-1-6654-2812-5},
}