Edinburgh Research Explorer

HMM-based speech synthesiser using the LF-model of the glottal source

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Original languageEnglish
Title of host publicationAcoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Pages4704-4707
Number of pages4
ISBN (Electronic)978-1-4577-0537-3
DOIs
Publication statusPublished - 1 May 2011
EventICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) - Prague, Czech Republic, United Kingdom
Duration: 22 May 201127 May 2011

Conference

ConferenceICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
CountryUnited Kingdom
Period22/05/1127/05/11

Abstract

A major factor which causes a deterioration in speech quality in HMM-based speech synthesis is the use of a simple delta pulse signal to generate the excitation of voiced speech. This paper sets out a new approach to using an acoustic glottal source model in HMM-based synthesisers instead of the traditional pulse signal. The goal is to improve speech quality and to better model and transform voice characteristics. We have found the new method decreases buzziness and also improves prosodic modelling. A perceptual evaluation has supported this finding by showing a 55.6% preference for the new system, as against the baseline. This improvement, while not being as significant as we had initially expected, does encourage us to work on developing the proposed speech synthesiser further.

Download statistics

No data available

ID: 4930085