Preserving speaker information in direct Speech-to-Speech Translation with non-autoregressive generation and pre-training

Rui Zhou, Akinori Ito, Takashi Nose. Preserving speaker information in direct Speech-to-Speech Translation with non-autoregressive generation and pre-training. Computer Speech & Language, 97:101902, 2026. [doi]

Abstract

Abstract is missing.