Abstract / Description of output
This paper aims at investigating the potentials of the phase spectrum in automatic speech recognition (ASR). We show that speech phase spectrum could potentially provide features with high discriminability and robustness. Out of such belief and to realize a higher portion of the phase spectrum potentials, we propose two simple amendments in two common blocks in feature extraction, namely pre-emphasis and windowing, without changing the workflow of the algorithms. Recognition tests over Aurora 2 indicate up to 11.2% and 14.7% performance improvement in average in the presence of both additive and convolutional noises for phase-based MODGDF and CGDF features, respectively. It proves the high potentials of the phase spectrum in robust ASR.
Original language | English |
---|---|
Title of host publication | Advances in Nonlinear Speech Processing |
Subtitle of host publication | 6th International Conference, NOLISP 2013, Mons, Belgium, June 19-21, 2013. Proceedings |
Editors | Thomas Drugman, Thierry Dutoit |
Place of Publication | Berlin, Heidelberg |
Publisher | Springer |
Pages | 160-167 |
Number of pages | 8 |
ISBN (Print) | 978-3-642-38847-7 |
DOIs | |
Publication status | Published - 2013 |
Event | NOLISP 2013: Non-Linear Speech Processing - Mons, Belgium Duration: 19 Jun 2013 → 21 Jun 2013 http://www.tcts.fpms.ac.be/nolisp2013/index.php |
Publication series
Name | Lecture Notes in Computer Science |
---|---|
Volume | 7911 |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | NOLISP 2013 |
---|---|
Abbreviated title | NOLISP 2013 |
Country/Territory | Belgium |
City | Mons |
Period | 19/06/13 → 21/06/13 |
Internet address |