Abstract / Description of output
We explore the use of linguistic features for text to speech (TTS) conversion in the context of a speech-to-speech translation system that can be extracted from unannotated text in an unsupervised, language-independent fashion. The features are intended to act as surrogates for conventional part of speech (POS) features. Unlike POS features, the experimental features assume only the availability of tools and data that must already be in place for the construction of other components of the translation system, and can therefore be used for the TTS module without incurring additional TTS-specific costs. We here describe the use of the experimental features in a speech synthesiser, using six different configurations of the system to allow the comparison of the proposed features with conventional, knowledge-based POS features. We present results of objective and subjective evaluations of the usefulness of the new features.
Original language | English |
---|---|
Title of host publication | INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association, Florence, Italy, August 27-31, 2011 |
Publisher | ISCA |
Pages | 2153-2156 |
Number of pages | 4 |
Publication status | Published - 2011 |
Event | Interspeech 2011- 12th annual Conference of the International Speech Communication Association - Florence, Italy Duration: 27 Aug 2011 → 31 Aug 2011 |
Conference
Conference | Interspeech 2011- 12th annual Conference of the International Speech Communication Association |
---|---|
Country/Territory | Italy |
City | Florence |
Period | 27/08/11 → 31/08/11 |