Estimating the Spectral Envelope of Voiced Speech Using Multi-frame Analysis

Yoshinori Shiga, Simon King

Research output: Chapter in Book/Report/Conference proceedingConference contribution


This paper proposes a novel approach for estimating the spectral envelope of voiced speech independently of its harmonic structure. Because of the quasi-periodicity of voiced speech, its spectrum indicates harmonic structure and only has energy at frequencies corresponding to integral multiples of F0. It is hence impossible to identify transfer characteristics between the adjacent harmonics. In order to resolve this problem, Multi-frame Analysis (MFA) is introduced. The MFA estimates a spectral envelope using many portions of speech which are vocalised using the same vocal-tract shape. Since each of the portions usually has a different F0 and ensuing different harmonic structure, a number of harmonics can be obtained at various frequencies to form a spectral envelope. The method thereby gives a closer approximation to the vocal-tract transfer function.
Original languageEnglish
Title of host publication Eurospeech 2003 - Interspeech 2003
Subtitle of host publication8th European Conference on Speech Communication and Technology
PublisherInternational Speech Communication Association
Number of pages4
ISBN (Print)ISSN: 1990-9772
Publication statusPublished - 1 Sep 2003


Dive into the research topics of 'Estimating the Spectral Envelope of Voiced Speech Using Multi-frame Analysis'. Together they form a unique fingerprint.

Cite this