Speaker similarity evaluation of foreign-accented speech synthesis using HMM-based speaker adaptation

M. Wester, R. Karhila

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper describes a speaker discrimination experiment in which native English listeners were presented with natural and synthetic speech stimuli in English and were asked to judge whether they thought the sentences were spoken by the same person or not. The natural speech consisted of recordings of Finnish speakers speaking English. The synthetic stimuli were created using adaptation data from the same Finnish speakers. Two average voice models were compared: one trained on Finnish-accented English and the other on American-accented English. The experiments illustrate that listeners perform well at speaker discrimination when the stimuli are both natural or both synthetic, but when the speech types are crossed performance drops significantly. We also found that the type of accent in the average voice model had no effect on the listeners' speaker discrimination performance.
Original languageEnglish
Title of host publicationAcoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Pages5372-5375
Number of pages4
ISBN (Electronic)978-1-4577-0537-3
DOIs
Publication statusPublished - 1 May 2011

Fingerprint

Dive into the research topics of 'Speaker similarity evaluation of foreign-accented speech synthesis using HMM-based speaker adaptation'. Together they form a unique fingerprint.

Cite this