Abstract / Description of output
Affective speech synthesis is an active research area, but recent approaches usually lack the full, fine-grained controllability to produce utterances with any exact affect intended by the user. We propose a puppetry tool based on FastPitch to help model output convey any required suprasegmental meanings. Users can choose any trained FastPitch model, and which features should be mimicked, making the approach fine-grained and language-independent.
Original language | English |
---|---|
Title of host publication | Proceedings of the Annual Conference of the International Speech Communication Association |
Publisher | ISCA |
Pages | 5219-5220 |
Number of pages | 2 |
Volume | 2022-September |
Publication status | Published - 22 Sept 2022 |
Event | Interspeech 2022 - Incheon, Korea, Republic of Duration: 18 Sept 2022 → 22 Sept 2022 Conference number: 23 https://interspeech2022.org/ |
Publication series
Name | Interspeech |
---|---|
Publisher | ISCA |
ISSN (Electronic) | 2308-457X |
Conference
Conference | Interspeech 2022 |
---|---|
Country/Territory | Korea, Republic of |
City | Incheon |
Period | 18/09/22 → 22/09/22 |
Internet address |
Keywords / Materials (for Non-textual outputs)
- speech synthesis
- voice puppetry