PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation

Yuan Gong, Yu-An Chung, James R. Glass. PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation. IEEE Transactions on Audio, Speech & Language Processing, 29:3292-3306, 2021. [doi]

@article{GongCG21,
  title = {PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation},
  author = {Yuan Gong and Yu-An Chung and James R. Glass},
  year = {2021},
  doi = {10.1109/TASLP.2021.3120633},
  url = {https://doi.org/10.1109/TASLP.2021.3120633},
  researchr = {https://researchr.org/publication/GongCG21},
  cites = {0},
  citedby = {0},
  journal = {IEEE Transactions on Audio, Speech & Language Processing},
  volume = {29},
  pages = {3292-3306},
}