Utilising Spontaneous Conversational Speech in HMM-Based Speech Synthesis

Sebastian Andersson, Junichi Yamagishi, Robert Clark

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Spontaneous conversational speech has many characteristics that are currently not well modelled in unit selection and HMM-based speech synthesis. But in order to build synthetic voices more suitable for interaction we need data that exhibits more conversational characteristics than the generally used read aloud sentences. In this paper we will show how carefully selected utterances from a spontaneous conversation was instrumental for building an HMM-based synthetic voices with more natural sounding conversational characteristics than a voice based on carefully read aloud sentences. We also investigated a style blending technique as a solution to the inherent problem of phonetic coverage in spontaneous speech data. But the lack of an appropriate representation of spontaneous speech phenomena probably contributed to results showing that we could not yet compete with the speech quality achieved for grammatical sentences.
Original languageEnglish
Title of host publicationThe 7th ISCA Tutorial and Research Workshop on Speech Synthesis
Publication statusPublished - 1 Sep 2010

Fingerprint

Dive into the research topics of 'Utilising Spontaneous Conversational Speech in HMM-Based Speech Synthesis'. Together they form a unique fingerprint.

Cite this