TY - JOUR
T1 - Impacts of machine translation and speech synthesis on speech-to-speech translation
AU - Hashimoto, Kei
AU - Yamagishi, Junichi
AU - Byrne, William
AU - King, Simon
AU - Tokuda, Keiichi
PY - 2012
Y1 - 2012
N2 - This paper analyzes the impacts of machine translation and speech synthesis on speech-to-speech translation systems. A typical speech-to-speech translation system consists of three components: speech recognition, machine translation and speech synthesis. Many techniques have been proposed for integration of speech recognition and machine translation. However, corresponding techniques have not yet been considered for speech synthesis. The focus of the current work is machine translation and speech synthesis, and we present a subjective evaluation designed to analyze their impact on speech-to-speech translation. The results of these analyses show that the naturalness and intelligibility of the synthesized speech are strongly affected by the fluency of the translated sentences. In addition, several features were found to correlate well with the average fluency of the translated sentences and the average naturalness of the synthesized speech.
AB - This paper analyzes the impacts of machine translation and speech synthesis on speech-to-speech translation systems. A typical speech-to-speech translation system consists of three components: speech recognition, machine translation and speech synthesis. Many techniques have been proposed for integration of speech recognition and machine translation. However, corresponding techniques have not yet been considered for speech synthesis. The focus of the current work is machine translation and speech synthesis, and we present a subjective evaluation designed to analyze their impact on speech-to-speech translation. The results of these analyses show that the naturalness and intelligibility of the synthesized speech are strongly affected by the fluency of the translated sentences. In addition, several features were found to correlate well with the average fluency of the translated sentences and the average naturalness of the synthesized speech.
U2 - 10.1016/j.specom.2012.02.004
DO - 10.1016/j.specom.2012.02.004
M3 - Article
SN - 0167-6393
VL - 54
SP - 857
EP - 866
JO - Speech Communication
JF - Speech Communication
IS - 7
ER -