Audio-to-Score Singing Transcription Based on Joint Estimation of Pitches, Onsets, and Metrical Positions With Tatum-Level CTC Loss

Tengyu Deng, Eita Nakamura, Kazuyoshi Yoshii. Audio-to-Score Singing Transcription Based on Joint Estimation of Pitches, Onsets, and Metrical Positions With Tatum-Level CTC Loss. In Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023, Taipei, Taiwan, October 31 - Nov. 3, 2023. pages 583-590, IEEE, 2023. [doi]

@inproceedings{DengNY23,
  title = {Audio-to-Score Singing Transcription Based on Joint Estimation of Pitches, Onsets, and Metrical Positions With Tatum-Level CTC Loss},
  author = {Tengyu Deng and Eita Nakamura and Kazuyoshi Yoshii},
  year = {2023},
  doi = {10.1109/APSIPAASC58517.2023.10317419},
  url = {https://doi.org/10.1109/APSIPAASC58517.2023.10317419},
  researchr = {https://researchr.org/publication/DengNY23},
  cites = {0},
  citedby = {0},
  pages = {583-590},
  booktitle = {Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023, Taipei, Taiwan, October 31 - Nov. 3, 2023},
  publisher = {IEEE},
  isbn = {979-8-3503-0067-3},
}