Prosodic control of unit-selection speech synthesis: A probabilistic approach

Christophe Veaux, Xavier Rodet

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

One problem in concatenative speech synthesis is how to incorporate prosodic factors in the unit selection. Imposing a predicted prosodic target is error-prone and does not benefit from the prosodic variability of the database. In this paper, we assume that several prosodic contours exist in the database for a same symbolic entry. This variability is represented by probabilistic models of the prosodic contours and the optimal sequence of units is searched by maximizing a joint likelihood at both segmental and prosodic levels. A generalized Viterbi algorithm is used to take into account the long-term dependencies introduced by the prosodic models. This method has been implemented in a unit selection synthesizer using an expressive speech database and a subjective experiment shows an improvement of the speech naturalness compared to a conventional unit-selection method.
Original languageEnglish
Title of host publicationProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011, May 22-27, 2011, Prague Congress Center, Prague, Czech Republic
Pages5360-5363
Number of pages4
DOIs
Publication statusPublished - 2011
Event2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) - Prague Congress Center, Prague, Czech Republic
Duration: 22 May 201127 May 2011

Conference

Conference2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Country/TerritoryCzech Republic
CityPrague
Period22/05/1127/05/11

Keywords

  • prosody
  • Speech Synthesis
  • unit selection

Fingerprint

Dive into the research topics of 'Prosodic control of unit-selection speech synthesis: A probabilistic approach'. Together they form a unique fingerprint.

Cite this