Projects per year
Abstract
Acoustic models used for statistical parametric speech synthesis typically incorporate many modelling assumptions. It is an open question to what extent these assumptions limit the naturalness of synthesised speech. To investigate this question, we recorded a speech corpus where each prompt was read aloud multiple times. By combining speech parameter trajectories extracted from different repetitions, we were able to quantify the perceptual effects of certain commonly used modelling assumptions. Subjective listening tests show that taking the source and filter parameters to be conditionally independent, or using diagonal covariance matrices, significantly limits the naturalness that can be achieved. Our experimental results also demonstrate the shortcomings of mean-based parameter generation.
Original language | English |
---|---|
Title of host publication | INTERSPEECH 2014 15th Annual Conference of the International Speech Communication Association |
Publisher | International Speech Communication Association |
Pages | 1504-1508 |
Number of pages | 5 |
Publication status | Published - 2014 |
Fingerprint
Dive into the research topics of 'Measuring the Perceptual Effects of Modelling Assumptions in Speech Synthesis Using Stimuli Constructed from Repeated Natural Speech'. Together they form a unique fingerprint.Projects
- 2 Finished
-
INSPIRE - Investigating speech processsing in realistic environments
1/01/12 → 31/12/15
Project: Research
-
Natural Speech Technology
Renals, S. (Principal Investigator) & King, S. (Co-investigator)
1/05/11 → 31/07/16
Project: Research