Edinburgh Research Explorer

Joint Prosodic and Segmental Unit Selection Speech Synthesis

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Related Edinburgh Organisations

Open Access permissions

Open

Documents

  • Download as Adobe PDF

    Rights statement: © Clark, R. A. J., & King, S. (2006). Joint Prosodic and Segmental Unit Selection Speech Synthesis. In Interspeech 2006- ICSLP: 9th International Conference on Spoken Language Processing. (pp. paper 1262). International Speech Communication Association.

    Accepted author manuscript, 48 KB, PDF document

Original languageEnglish
Title of host publicationInterspeech 2006- ICSLP
Subtitle of host publication9th International Conference on Spoken Language Processing
PublisherInternational Speech Communication Association
ISBN (Print)1990-9772
Publication statusPublished - 1 Sep 2006

Abstract

We describe a unit selection technique for text-to-speech synthesis which jointly searches the space of possible diphone sequences and the space of possible prosodic unit sequences in order to produce synthetic speech with more natural prosody. We demonstrates that this search, although currently computationally expensive, can achieve improved intonation compared to a baseline in which only the space of possible diphone sequences is searched. We discuss ways in which the search could be made sufficiently efficient for use in a real-time system.

Download statistics

No data available

ID: 2077094