Accurate spectral envelope estimation for articulation-to-speech synthesis

Yoshinori Shiga, Simon King

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper introduces a novel articulatory-acoustic mapping in which detailed spectral envelopes are estimated based on the cepstrum, inclusive of the high-quefrency elements which are discarded in conventional speech synthesis to eliminate the pitch component of speech. For this estimation, the method deals with the harmonics of multiple voiced-speech spectra so that several sets of harmonics can be obtained at various pitch frequencies to form a spectral envelope. The experimental result shows that the method estimates spectral envelopes with the highest accuracy when the cepstral order is 48--64, which suggests that the higher order coeffcients are required to represent detailed envelopes reflecting the real vocal-tract responses.
Original languageEnglish
Title of host publicationProc. 5th ISCA Speech Synthesis Workshop
PublisherInternational Speech Communication Association
Pages19-24
Number of pages6
Publication statusPublished - 1 Jun 2004

Fingerprint

Dive into the research topics of 'Accurate spectral envelope estimation for articulation-to-speech synthesis'. Together they form a unique fingerprint.

Cite this