Improving Speech Translation Accuracy and Time Efficiency With Fine-Tuned wav2vec 2.0-Based Speech Segmentation

Ryo Fukuda, Katsuhito Sudoh, Satoshi Nakamura 0001. Improving Speech Translation Accuracy and Time Efficiency With Fine-Tuned wav2vec 2.0-Based Speech Segmentation. IEEE Transactions on Audio, Speech & Language Processing, 32:906-916, 2024. [doi]

@article{FukudaSN24,
  title = {Improving Speech Translation Accuracy and Time Efficiency With Fine-Tuned wav2vec 2.0-Based Speech Segmentation},
  author = {Ryo Fukuda and Katsuhito Sudoh and Satoshi Nakamura 0001},
  year = {2024},
  doi = {10.1109/TASLP.2023.3343614},
  url = {https://doi.org/10.1109/TASLP.2023.3343614},
  researchr = {https://researchr.org/publication/FukudaSN24},
  cites = {0},
  citedby = {0},
  journal = {IEEE Transactions on Audio, Speech & Language Processing},
  volume = {32},
  pages = {906-916},
}