Projects per year
Abstract
Children's speech poses challenges to speech recognition due to strong age-dependent anatomical variations and a lack of large, publicly-available corpora. In this paper we explore data augmentation for children's speech recognition using stochastic feature mapping (SFM) to transform out-of-domain adult data for both GMM-based and DNN-based acoustic models. We performed experiments on the English PF-STAR corpus, augmenting using WSJCAM0 and ABI. Our experimental results indicate that a DNN acoustic model for childrens speech can make use of adult data, and that out-of-domain SFM is more accurate than in-domain SFM.
| Original language | English |
|---|---|
| Title of host publication | Interspeech 2016 |
| Pages | 1598-1602 |
| Number of pages | 5 |
| DOIs | |
| Publication status | Published - 12 Sept 2016 |
| Event | Interspeech 2016 - San Francisco, United States Duration: 8 Sept 2016 → 12 Sept 2016 http://www.interspeech2016.org/ |
Publication series
| Name | Interspeech |
|---|---|
| Publisher | International Speech Communication Association |
| ISSN (Print) | 1990-9772 |
Conference
| Conference | Interspeech 2016 |
|---|---|
| Country/Territory | United States |
| City | San Francisco |
| Period | 8/09/16 → 12/09/16 |
| Internet address |
Fingerprint
Dive into the research topics of 'Improving Children's Speech Recognition through Out-of-Domain Data Augmentation'. Together they form a unique fingerprint.Projects
- 1 Finished
-
Multi-domain speech recognition
Renals, S. (Principal Investigator)
Non-EU industry, commerce and public corporations
1/09/15 → 28/02/19
Project: Research