Abstract
This paper proposes a method for generating speech from filterbank mel frequency cepstral coefficients (MFCC), which are widely used in speech applications, such as ASR, but are generally considered unusable for speech synthesis. First, we predict fundamental frequency and voicing information from MFCCs with an autoregressive recurrent neural net. Second, the spectral envelope information contained in MFCCs is converted to all-pole filters, and a pitchsynchronous excitation model matched to these filters is trained. Finally, we introduce a generative adversarial network -based noise model to add a realistic high-frequency stochastic component to the modeled excitation signal. The results show that high quality speech reconstruction can be obtained, given only MFCC information at test time.
Index Terms— MFCC, Pitch prediction, Mel-filterbank inversion, Excitation modeling, Generative adversarial networks
Index Terms— MFCC, Pitch prediction, Mel-filterbank inversion, Excitation modeling, Generative adversarial networks
Original language | English |
---|---|
Title of host publication | 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) |
Subtitle of host publication | Calgary, AB, Canada |
Place of Publication | Calgary, Alberta, Canada |
Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
Pages | 5679-5683 |
Number of pages | 5 |
ISBN (Electronic) | 978-1-5386-4658-8 |
ISBN (Print) | 978-1-5386-4659-5 |
DOIs | |
Publication status | Published - 13 Sep 2018 |
Event | 2018 IEEE International Conference on Acoustics, Speech and Signal Processing - Calgary, Canada Duration: 15 Apr 2018 → 20 Apr 2018 https://2018.ieeeicassp.org/ https://2018.ieeeicassp.org/default.asp |
Publication series
Name | |
---|---|
Publisher | IEEE |
ISSN (Electronic) | 2379-190X |
Conference
Conference | 2018 IEEE International Conference on Acoustics, Speech and Signal Processing |
---|---|
Abbreviated title | ICASSP 2018 |
Country/Territory | Canada |
City | Calgary |
Period | 15/04/18 → 20/04/18 |
Internet address |