Projects per year
Abstract / Description of output
When a text-to-speech (TTS) system is required to speak world news, a large fraction of the words to be spoken will be proper names originating in a wide variety of languages. Phonetization of these names based on target language letter-to-sound rules will typically be inadequate. This is detrimental not only during synthesis, when inappropriate phone sequences are produced, but also during training, if the system is trained on data from the same domain. This is because poor phonetization during forced alignment based on hidden Markov models can pollute the whole model set, resulting in degraded alignment even of normal target-language words. This paper presents four techniques designed to address this issue in the context of a Swahili TTS system: automatic transcription of proper names based on a lexicon from a better-resourced language; the addition of a parallel phone set and special part-of-speech tag exclusively dedicated to proper names; a manually-crafted phone mapping which allows substitutions for potentially more accurate phones in proper names during forced alignment; the addition in proper names of a grapheme-derived frame-level feature, supplementing the standard phonetic inputs to the acoustic model. We present results from objective and subjective evaluations of systems built using these four techniques.
Original language | English |
---|---|
Title of host publication | Proceedings Interspeech 2017 |
Pages | 2188-2192 |
Number of pages | 5 |
DOIs | |
Publication status | Published - 24 Aug 2017 |
Event | Interspeech 2017 - Stockholm, Sweden Duration: 20 Aug 2017 → 24 Aug 2017 http://www.interspeech2017.org/ |
Publication series
Name | |
---|---|
Publisher | ISCA |
ISSN (Electronic) | 1990-9772 |
Conference
Conference | Interspeech 2017 |
---|---|
Country/Territory | Sweden |
City | Stockholm |
Period | 20/08/17 → 24/08/17 |
Internet address |
Fingerprint
Dive into the research topics of 'Nativization of foreign names in TTS for automatic reading of world news in Swahili'. Together they form a unique fingerprint.Projects
- 2 Finished
-
SCRIPT : Speech Synthesis for Spoken Content Production
Yamagishi, J., King, S. & Watts, O.
1/12/16 → 30/11/19
Project: Research
-
PPLS Pilot Project: Proof-of-concept hybrid speech synthesis system, for the BBC World Service.
1/07/16 → 31/07/16
Project: University Awarded Project Funding