An HMM-based speech synthesiser using Glottal Post-Filtering

João P. Cabral, Steve Renals, Korin Richmond, Junichi Yamagishi

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract / Description of output

Control over voice quality, e.g. breathy and tense voice, is important for speech synthesis applications. For example, transformations can be used to modify aspects of the voice related to speaker's identity and to improve expressiveness. However, it is hard to modify voice characteristics of the synthetic speech, without degrading speech quality. State-of-the-art statistical speech synthesisers, in particular, do not typically allow control over parameters of the glottal source, which are strongly correlated with voice quality. Consequently, the control of voice characteristics in these systems is limited. In contrast, the HMM-based speech synthesiser proposed in this paper uses an acoustic glottal source model. The system passes the glottal signal through a whitening filter to obtain the excitation of voiced sounds. This technique, called glottal post-filtering, allows to transform voice characteristics of the synthetic speech by modifying the source model parameters. We evaluated the proposed synthesiser in a perceptual experiment, in terms of speech naturalness, intelligibility, and similarity to the original speaker's voice. The results show that it performed as well as a HMM-based synthesiser, which generates the speech signal with a commonly used high-quality speech vocoder.

Original languageEnglish
Title of host publicationProceedings of The Seventh ISCA Tutorial and Research Workshop on Speech Synthesis Kyoto, Japan September 22-24, 2010
EditorsYoshinori Sagisaka, Keiichi Tokuda
PublisherISCA
Pages365-370
Number of pages6
Publication statusPublished - 24 Sept 2010
Event7th ISCA Tutorial and Research Workshop on Speech Synthesis, SSW 2010 - Kyoto, Japan
Duration: 22 Sept 201024 Sept 2010

Publication series

NameThe Seventh ISCA Tutorial and Research Workshop on Speech Synthesis
PublisherISCA
ISSN (Electronic)1680-8908

Conference

Conference7th ISCA Tutorial and Research Workshop on Speech Synthesis, SSW 2010
Country/TerritoryJapan
CityKyoto
Period22/09/1024/09/10

Keywords / Materials (for Non-textual outputs)

  • glottal post-filter
  • HMM-based speech synthesis
  • voice quality

Fingerprint

Dive into the research topics of 'An HMM-based speech synthesiser using Glottal Post-Filtering'. Together they form a unique fingerprint.

Cite this