Projects per year
Abstract / Description of output
Global style tokens (GSTs) allow for rich modelling of the variation in a speech corpus and subsequent control of text-to-speech synthesis (TTS). However, certain styles of speech may be marked by variation along multiple dimensions, complicating the interpretation and control of learned style tokens. One example is hyperarticulated or ‘clear’ speech, for example as directed toward listeners with hearing impairments or language learners in the classroom, which in English is characterised by reduced speaking rate, increased F0, more careful articulation of vowels and plosive consonants, and other factors. We present a method for simplifying control of style tokens by applying principal components analysis (PCA) to GST weights from a TTS system trained on both plain and clear speech. We identify the axes of variation in PCA space with the acoustic correlates of clear speech in English and show that we can synthesise either style by moving along a single dimension in that space.
Original language | English |
---|---|
Title of host publication | Interspeech 2024 |
Publisher | International Speech Communication Association (ISCA) |
Pages | 1-5 |
Number of pages | 5 |
DOIs | |
Publication status | Published - 1 Sept 2024 |
Event | The 25th Interspeech Conference - Kipriotis International Convention Center, Kos Island, Greece Duration: 1 Sept 2024 → 5 Sept 2024 Conference number: 25 https://interspeech2024.org/ |
Publication series
Name | Interspeech |
---|---|
Publisher | International Speech Communication Association (ISCA) |
ISSN (Electronic) | 2958-1796 |
Conference
Conference | The 25th Interspeech Conference |
---|---|
Abbreviated title | Interspeech 2024 |
Country/Territory | Greece |
City | Kos Island |
Period | 1/09/24 → 5/09/24 |
Internet address |
Fingerprint
Dive into the research topics of 'Low-dimensional style token control for hyperarticulated speech synthesis'. Together they form a unique fingerprint.Projects
- 1 Active
Activities
- 1 Hosting an academic visitor
-
Miku Nishihara
Korin Richmond (Host)
7 Jun 2023 → 12 Sept 2023Activity: Hosting a visitor types › Hosting an academic visitor