Multisyn: Open-domain unit selection for the Festival speech synthesis system

Robert A. J. Clark, Korin Richmond, Simon King

Research output: Contribution to journalArticlepeer-review

Abstract / Description of output

We present the implementation and evaluation of an open-domain unit selection speech synthesis engine designed to be flexible enough to encourage further unit selection research and allow rapid voice development by users with minimal speech synthesis knowledge and experience. We address the issues of automatically processing speech data into a usable voice using automatic segmentation techniques and how the knowledge obtained at labelling time can be exploited at synthesis time. We describe target cost and join cost implementation for such a system and describe the outcome of building voices with a number of different sized datasets. We show that, in a competitive evaluation, voices built using this technology compare favourably to other systems.
Original languageEnglish
Pages (from-to)317-330
Number of pages14
JournalSpeech Communication
Volume49
Issue number4
Early online date15 Feb 2007
DOIs
Publication statusPublished - Apr 2007

Keywords / Materials (for Non-textual outputs)

  • Speech synthesis
  • Unit selection

Fingerprint

Dive into the research topics of 'Multisyn: Open-domain unit selection for the Festival speech synthesis system'. Together they form a unique fingerprint.

Cite this