Subjective Evaluation Of Join Cost Functions Used In Unit Selection Speech Synthesis

Jithendra Vepa, Simon King

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In our previous papers, we have proposed join cost functions derived from spectral distances, which have good correlations with perceptual scores obtained for a range of concatenation discontinuities. To further validate their ability to predict concatenation discontinuities, we have chosen the best three spectral distances and evaluated them subjectively in a listening test. The unit sequences for synthesis stimuli are obtained from a state-of-the-art unit selection text-to speech system: rVoice from Rhetorical Systems Ltd. In this paper, we report listeners preferences for each of the three join cost functions.
Original languageEnglish
Title of host publicationInterspeech 2004 - ICSLP
Subtitle of host publication8th International Conference on Spoken Language Processing
PublisherInternational Speech Communication Association
Pages1181-1184
Number of pages4
ISBN (Print)ISSN: 1990-9772
Publication statusPublished - 1 Oct 2004

Fingerprint

Dive into the research topics of 'Subjective Evaluation Of Join Cost Functions Used In Unit Selection Speech Synthesis'. Together they form a unique fingerprint.

Cite this