Learning pronunciation from other accents via pronunciation knowledge transfer

Siqi Sun, Korin Richmond

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract / Description of output

Bootstrapping has proven to be effective in transforming a conventional pipeline-based linguistic frontend to an integrated Sequence-to-Sequence (Seq2Seq) frontend for text-to-speech (TTS). However, for target accents with limited lexical coverage, the performance of bootstrapped Seq2Seq frontends would be greatly limited. In this work, we utilize multi-accent bootstrapping for rich-resource source accents and low-resource target accents to enable pronunciation knowledge transfer between them, effectively enlarging the lexical coverage of target accent. We formally analyze the effect of transfer between 3 English accents (word accuracy increase of 12%–17% absolute for transferred words) and how it scales with the number of annotated unique word types in the target accent. When annotating as few as 1k word types for the target accent, the transfer achieves a word accuracy of 81% for transferred words, approaching the generalisation ability of a baseline annotating 51k word types.
Original languageEnglish
Title of host publicationInterspeech 2024
PublisherInternational Speech Communication Association (ISCA)
Pages1-5
Number of pages5
DOIs
Publication statusPublished - 1 Sept 2024
EventThe 25th Interspeech Conference - Kipriotis International Convention Center, Kos Island, Greece
Duration: 1 Sept 20245 Sept 2024
Conference number: 25
https://interspeech2024.org/

Publication series

NameInterspeech
PublisherInternational Speech Communication Association (ISCA)
ISSN (Electronic)2958-1796

Conference

ConferenceThe 25th Interspeech Conference
Abbreviated titleInterspeech 2024
Country/TerritoryGreece
CityKos Island
Period1/09/245/09/24
Internet address

Fingerprint

Dive into the research topics of 'Learning pronunciation from other accents via pronunciation knowledge transfer'. Together they form a unique fingerprint.

Cite this