Improving Children's Speech Recognition through Out-of-Domain Data Augmentation

Joachim Fainberg, Peter Bell, Mike Lincoln, Steve Renals

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Children's speech poses challenges to speech recognition due to strong age-dependent anatomical variations and a lack of large, publicly-available corpora. In this paper we explore data augmentation for children's speech recognition using stochastic feature mapping (SFM) to transform out-of-domain adult data for both GMM-based and DNN-based acoustic models. We performed experiments on the English PF-STAR corpus, augmenting using WSJCAM0 and ABI. Our experimental results indicate that a DNN acoustic model for childrens speech can make use of adult data, and that out-of-domain SFM is more accurate than in-domain SFM.
Original languageEnglish
Title of host publicationInterspeech 2016
Pages1598-1602
Number of pages5
DOIs
Publication statusPublished - 12 Sep 2016
EventInterspeech 2016 - San Francisco, United States
Duration: 8 Sep 201612 Sep 2016
http://www.interspeech2016.org/

Publication series

NameInterspeech
PublisherInternational Speech Communication Association
ISSN (Print)1990-9772

Conference

ConferenceInterspeech 2016
Country/TerritoryUnited States
CitySan Francisco
Period8/09/1612/09/16
Internet address

Fingerprint

Dive into the research topics of 'Improving Children's Speech Recognition through Out-of-Domain Data Augmentation'. Together they form a unique fingerprint.

Cite this