Projects per year
Abstract / Description of output
Bootstrapping has proven to be effective in transforming a conventional pipeline-based linguistic frontend to an integrated Sequence-to-Sequence (Seq2Seq) frontend for text-to-speech (TTS). However, for target accents with limited lexical coverage, the performance of bootstrapped Seq2Seq frontends would be greatly limited. In this work, we utilize multi-accent bootstrapping for rich-resource source accents and low-resource target accents to enable pronunciation knowledge transfer between them, effectively enlarging the lexical coverage of target accent. We formally analyze the effect of transfer between 3 English accents (word accuracy increase of 12%–17% absolute for transferred words) and how it scales with the number of annotated unique word types in the target accent. When annotating as few as 1k word types for the target accent, the transfer achieves a word accuracy of 81% for transferred words, approaching the generalisation ability of a baseline annotating 51k word types.
Original language | English |
---|---|
Title of host publication | Interspeech 2024 |
Publisher | International Speech Communication Association (ISCA) |
Pages | 1-5 |
Number of pages | 5 |
DOIs | |
Publication status | Published - 1 Sept 2024 |
Event | The 25th Interspeech Conference - Kipriotis International Convention Center, Kos Island, Greece Duration: 1 Sept 2024 → 5 Sept 2024 Conference number: 25 https://interspeech2024.org/ |
Publication series
Name | Interspeech |
---|---|
Publisher | International Speech Communication Association (ISCA) |
ISSN (Electronic) | 2958-1796 |
Conference
Conference | The 25th Interspeech Conference |
---|---|
Abbreviated title | Interspeech 2024 |
Country/Territory | Greece |
City | Kos Island |
Period | 1/09/24 → 5/09/24 |
Internet address |
Fingerprint
Dive into the research topics of 'Learning pronunciation from other accents via pronunciation knowledge transfer'. Together they form a unique fingerprint.Projects
- 1 Finished
-
Two studentships in Natural Language Processing
Non-EU industry, commerce and public corporations
1/09/20 → 31/08/24
Project: Research