An experimental comparison of multiple vocoder types

Qiong Hu, Korin Richmond, Junichi Yamagishi, Javier Latorre

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper presents an experimental comparison of a broad range of the leading vocoder types which have been previously described. We use a reference implementation of each of these to create stimuli for a listening test using copy synthesis. The listening test is performed using both Lombard and normal read speech stimuli, and with two types of question for comparison. Multi-dimensional Scaling (MDS) is conducted on the listener responses to analyse similarities in terms of quality between the vocoders. Our MDS and clustering results show that the vocoders which use a sinusoidal synthesis approach are perceptually distinguishable from the source-filter vocoders. To help further interpret the axes of the resulting MDS space, we test for correlations with standard acoustic quality metrics and find one axis is strongly correlated with PESQ scores. We also find both speech style and the format of the listening test question may influence test results. Finally, we also present preference test results which compare each vocoder with the natural speech.
Original languageEnglish
Title of host publication8th ISCA Workshop on Speech Synthesis
Subtitle of host publicationBarcelona, Spain
Pages155-160
Number of pages6
Publication statusPublished - Aug 2013

Fingerprint

Dive into the research topics of 'An experimental comparison of multiple vocoder types'. Together they form a unique fingerprint.

Cite this