Unsupervised Features from Text for Speech Synthesis in a Speech-to-Speech Translation System

Oliver Watts, Bowen Zhou

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract / Description of output

We explore the use of linguistic features for text to speech (TTS) conversion in the context of a speech-to-speech translation system that can be extracted from unannotated text in an unsupervised, language-independent fashion. The features are intended to act as surrogates for conventional part of speech (POS) features. Unlike POS features, the experimental features assume only the availability of tools and data that must already be in place for the construction of other components of the translation system, and can therefore be used for the TTS module without incurring additional TTS-specific costs. We here describe the use of the experimental features in a speech synthesiser, using six different configurations of the system to allow the comparison of the proposed features with conventional, knowledge-based POS features. We present results of objective and subjective evaluations of the usefulness of the new features.
Original languageEnglish
Title of host publicationINTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association, Florence, Italy, August 27-31, 2011
PublisherISCA
Pages2153-2156
Number of pages4
Publication statusPublished - 2011
EventInterspeech 2011- 12th annual Conference of the International Speech Communication Association - Florence, Italy
Duration: 27 Aug 201131 Aug 2011

Conference

ConferenceInterspeech 2011- 12th annual Conference of the International Speech Communication Association
Country/TerritoryItaly
CityFlorence
Period27/08/1131/08/11

Fingerprint

Dive into the research topics of 'Unsupervised Features from Text for Speech Synthesis in a Speech-to-Speech Translation System'. Together they form a unique fingerprint.

Cite this