Abstract / Description of output

Affective speech synthesis is an active research area, but recent approaches usually lack the full, fine-grained controllability to produce utterances with any exact affect intended by the user. We propose a puppetry tool based on FastPitch to help model output convey any required suprasegmental meanings. Users can choose any trained FastPitch model, and which features should be mimicked, making the approach fine-grained and language-independent.
Original languageEnglish
Title of host publicationProceedings of the Annual Conference of the International Speech Communication Association
PublisherISCA
Pages5219-5220
Number of pages2
Volume2022-September
Publication statusPublished - 22 Sept 2022
EventInterspeech 2022 - Incheon, Korea, Republic of
Duration: 18 Sept 202222 Sept 2022
Conference number: 23
https://interspeech2022.org/

Publication series

NameInterspeech
PublisherISCA
ISSN (Electronic)2308-457X

Conference

ConferenceInterspeech 2022
Country/TerritoryKorea, Republic of
CityIncheon
Period18/09/2222/09/22
Internet address

Keywords / Materials (for Non-textual outputs)

  • speech synthesis
  • voice puppetry

Fingerprint

Dive into the research topics of 'Voice Puppetry with FastPitch'. Together they form a unique fingerprint.

Cite this