Abstract / Description of output
In this paper we present a novel feature extraction algorithm based on group delay function for robust speech recognition. The modified group delay function (MODGDF) is the main feature extraction method based on group delay function, generally used for robust speech recognition. The recognition tests indicate this feature does not provide notably better results in the presence of additive noise in comparison with MFCC. In the presence of convolutional noise, the performance of MODGDF is considerably lower than MFCC. The method proposed in this paper is simple and makes more efficient utilization of the high resolution property of GDF. It is formed from three main parts which are signal modeling, GDF computation based on extracted model, and compression. The recognition results obtained over AURORA 2.0 task indicate its superior performance in comparison with MODGDF and MFCC.
Original language | English |
---|---|
Title of host publication | 2011 IEEE International Conference on Multimedia and Expo |
Publisher | Institute of Electrical and Electronics Engineers |
Pages | 1-5 |
Number of pages | 5 |
ISBN (Electronic) | 978-1-61284-349-0 |
ISBN (Print) | 978-1-61284-348-3 |
DOIs | |
Publication status | Published - 1 Jul 2011 |
Event | 2011 IEEE International Conference on Multimedia and Expo - Barcelona, Spain Duration: 11 Jul 2011 → 15 Jul 2011 http://www.ieee-icme.org/icme2011/ |
Conference
Conference | 2011 IEEE International Conference on Multimedia and Expo |
---|---|
Abbreviated title | ICME 2011 |
Country/Territory | Spain |
City | Barcelona |
Period | 11/07/11 → 15/07/11 |
Internet address |
Keywords / Materials (for Non-textual outputs)
- Delay
- Speech
- Mel frequency cepstral coefficient
- Speech recognition
- Feature extraction
- Computational modeling
- Discrete cosine transforms
- Robust speech recognition
- group delay function
- signal modeling
- compression