Projects per year
Abstract
Children's speech poses challenges to speech recognition due to strong age-dependent anatomical variations and a lack of large, publicly-available corpora. In this paper we explore data augmentation for children's speech recognition using stochastic feature mapping (SFM) to transform out-of-domain adult data for both GMM-based and DNN-based acoustic models. We performed experiments on the English PF-STAR corpus, augmenting using WSJCAM0 and ABI. Our experimental results indicate that a DNN acoustic model for childrens speech can make use of adult data, and that out-of-domain SFM is more accurate than in-domain SFM.
Original language | English |
---|---|
Title of host publication | Interspeech 2016 |
Pages | 1598-1602 |
Number of pages | 5 |
DOIs | |
Publication status | Published - 12 Sep 2016 |
Event | Interspeech 2016 - San Francisco, United States Duration: 8 Sep 2016 → 12 Sep 2016 http://www.interspeech2016.org/ |
Publication series
Name | Interspeech |
---|---|
Publisher | International Speech Communication Association |
ISSN (Print) | 1990-9772 |
Conference
Conference | Interspeech 2016 |
---|---|
Country/Territory | United States |
City | San Francisco |
Period | 8/09/16 → 12/09/16 |
Internet address |
Fingerprint
Dive into the research topics of 'Improving Children's Speech Recognition through Out-of-Domain Data Augmentation'. Together they form a unique fingerprint.Projects
- 1 Finished
-
Multi-domain speech recognition
Non-EU industry, commerce and public corporations
1/09/15 → 28/02/19
Project: Research