On the Importance of Pre-emphasis and Window Shape in Phase-Based Speech Recognition

Erfan Loweimi, Seyed Mohammad Ahadi, Thomas Drugman, Samira Loveymi

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper aims at investigating the potentials of the phase spectrum in automatic speech recognition (ASR). We show that speech phase spectrum could potentially provide features with high discriminability and robustness. Out of such belief and to realize a higher portion of the phase spectrum potentials, we propose two simple amendments in two common blocks in feature extraction, namely pre-emphasis and windowing, without changing the workflow of the algorithms. Recognition tests over Aurora 2 indicate up to 11.2% and 14.7% performance improvement in average in the presence of both additive and convolutional noises for phase-based MODGDF and CGDF features, respectively. It proves the high potentials of the phase spectrum in robust ASR.
Original languageEnglish
Title of host publicationAdvances in Nonlinear Speech Processing
Subtitle of host publication6th International Conference, NOLISP 2013, Mons, Belgium, June 19-21, 2013. Proceedings
EditorsThomas Drugman, Thierry Dutoit
Place of PublicationBerlin, Heidelberg
PublisherSpringer Berlin Heidelberg
Pages160-167
Number of pages8
ISBN (Print)978-3-642-38847-7
DOIs
Publication statusPublished - 2013
EventNOLISP 2013: Non-Linear Speech Processing - Mons, Belgium
Duration: 19 Jun 201321 Jun 2013
http://www.tcts.fpms.ac.be/nolisp2013/index.php

Publication series

NameLecture Notes in Computer Science
Volume7911
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceNOLISP 2013
Abbreviated titleNOLISP 2013
CountryBelgium
CityMons
Period19/06/1321/06/13
Internet address

Fingerprint Dive into the research topics of 'On the Importance of Pre-emphasis and Window Shape in Phase-Based Speech Recognition'. Together they form a unique fingerprint.

Cite this