Projects per year
Abstract / Description of output
Join cost calculation has so far dealt exclusively with acoustic speech parameters, and a large number of distance metrics have previously been tested in conjunction with a wide variety of acoustic parameterisations. In contrast, we propose here to calculate distance in articulatory space. The motivation for this is simple: physical constraints mean a human talker’s mouth cannot “jump” from one configuration to a different one, so smooth evolution of articulator positions would also seem desirable for a good candidate unit sequence. To test this, we built Festival Multisyn voices using a large articulatory-acoustic dataset. We first synthesised 460 TIMIT sentences and confirmed our articulatory join cost gives appreciably different unit sequences compared to the standard Multisyn acoustic join cost. A listening test (3 sets of 25 sentence pairs, 30 listeners) then showed our articulatory cost is preferred at a rate of 58% compared to the standard Multisyn acoustic join cost.
Original language | English |
---|---|
Title of host publication | 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) |
Publisher | Institute of Electrical and Electronics Engineers |
Pages | 5150-5154 |
Number of pages | 5 |
ISBN (Print) | 978-1-4799-9988-0 |
DOIs | |
Publication status | Published - 2016 |
Event | 41st IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2016 - China, Shanghai, China Duration: 20 Mar 2016 → 25 Mar 2016 https://www2.securecms.com/ICASSP2016/Default.asp |
Conference
Conference | 41st IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2016 |
---|---|
Abbreviated title | ICASSP 2016 |
Country/Territory | China |
City | Shanghai |
Period | 20/03/16 → 25/03/16 |
Internet address |
Fingerprint
Dive into the research topics of 'Smooth talking: articulatory join costs for unit selection'. Together they form a unique fingerprint.Projects
- 2 Finished
Profiles
-
Korin Richmond
- School of Philosophy, Psychology and Language Sciences - Reader
- Institute of Language, Cognition and Computation
- Centre for Speech Technology Research
Person: Academic: Research Active