This paper aims to investigate potentials existing in speech phase spectrum. We observed that the window shape and scale incompatibility error (SIE) are two important factors which deeply influence the quality of phase-only reconstructed speech. After evaluating effects of different windows, we found Chebyshev window with dynamic range of 25 to 30 dB the best option. Inspiring from Hilbert transform relations, we removed the SIE and found the reason for quality improvement of ordinary phase-only reconstructed speech by frame length extension. Results show that phase spectrum, even in very short frame lengths such as 16 ms, can be highly informative.
|Title of host publication||Proc. Interspeech 2011|
|Number of pages||4|
|Publication status||Published - 2011|
|Event||Interspeech 2011- 12th annual Conference of the International Speech Communication Association - Florence, Italy|
Duration: 27 Aug 2011 → 31 Aug 2011
|Conference||Interspeech 2011- 12th annual Conference of the International Speech Communication Association|
|Period||27/08/11 → 31/08/11|