Projects per year
Abstract
In this paper we introduce a new cepstral coefficient extraction method based on an intelligibility measure for speech in noise, the Glimpse Proportion measure. This new method aims to increase the intelligibility of speech in noise by modifying the clean speech, and has applications in scenarios such as public announcement and car navigation systems. We first explain how the Glimpse Proportion measure operates and further show how we approximated it to integrate it into an existing spectral envelope parameter extraction method commonly used in the HMM-based speech synthesis framework. We then demonstrate how this new method changes the modelled spectrum according to the characteristics of the noise and show results for a listening test with vocoded and HMM-based synthetic speech. The test indicates that the proposed method can significantly improve intelligibility of synthetic speech in speech shaped noise.
| Original language | English |
|---|---|
| Title of host publication | 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) |
| Subtitle of host publication | Kyoto, Japan |
| Place of Publication | NEW YORK |
| Publisher | Institute of Electrical and Electronics Engineers |
| Pages | 3997-4000 |
| Number of pages | 4 |
| ISBN (Electronic) | 978-1-4673-0044-5 |
| ISBN (Print) | 978-1-4673-0045-2 |
| DOIs | |
| Publication status | Published - 2012 |
| Event | IEEE International Conference on Acoustics, Speech and Signal Processing - Kyoto Duration: 25 Mar 2012 → 30 Mar 2012 |
Publication series
| Name | IEEE International Conference on Acoustics Speech and Signal Processing |
|---|---|
| Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
| ISSN (Print) | 1520-6149 |
Conference
| Conference | IEEE International Conference on Acoustics, Speech and Signal Processing |
|---|---|
| City | Kyoto |
| Period | 25/03/12 → 30/03/12 |
Keywords / Materials (for Non-textual outputs)
- approximation theory
- hidden Markov models
- speech coding
- speech synthesis
- HMM-based speech synthesis framework
- HMM-based synthetic speech intelligibility
- car navigation systems
- cepstral coefficient extraction method
- clean speech modification
- glimpse proportion measure
- speech shaped noise
- vocoded synthetic speech
- Accuracy
- Approximation methods
- Cepstral analysis
- Hidden Markov models
- Noise
- Noise measurement
- Speech
- HMM-based speech synthesis
- Lombard speech
- cepstral coefficient extraction
- objective measure for speech intelligibility
Fingerprint
Dive into the research topics of 'Cepstral analysis based on the Glimpse proportion measure for improving the intelligibility of HMM-based synthetic speech in noise'. Together they form a unique fingerprint.Projects
- 2 Finished
-
Deep architectures for statistical speech synthesis
Yamagishi, J. (Principal Investigator)
UK industry, commerce and public corporations
4/09/12 → 3/03/16
Project: Research
-
LISTA: LISTA- The Listening Talker (RTGS)
King, S. (Principal Investigator), Mayo, C. (Co-investigator) & Renals, S. (Co-investigator)
1/05/10 → 30/04/13
Project: Research
Activities
- 1 Invited talk
-
EACL 2014 keynote: Speech synthesis needs YOU!
King, S. (Speaker)
29 Apr 2014Activity: Academic talk or presentation types › Invited talk
File