Projects per year
Abstract / Description of output
Ultrasound tongue imaging (UTI) provides a convenient way to visualize the vocal tract during speech production. UTI is increasingly being used for speech therapy, making it important to develop automatic methods to assist various time-consuming manual tasks currently performed by speech therapists. A key challenge is to generalize the automatic processing of ultrasound tongue images to previously unseen speakers. In this work, we investigate the classification of phonetic segments (tongue shapes) from raw ultrasound recordings under several training scenarios: speaker-dependent, multi-speaker, speaker-independent, and speaker-adapted. We observe that models underperform when applied to data from speakers not seen at training time. However, when provided with minimal additional speaker information, such as the mean ultrasound frame, the models generalize better to unseen speakers.
Original language | English |
---|---|
Title of host publication | ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) |
Place of Publication | Brighton, United Kingdom |
Publisher | Institute of Electrical and Electronics Engineers |
Pages | 1328-1332 |
Number of pages | 5 |
ISBN (Electronic) | 978-1-4799-8131-1 |
ISBN (Print) | 978-1-4799-8132-8 |
DOIs | |
Publication status | Published - 17 Apr 2019 |
Event | 44th International Conference on Acoustics, Speech, and Signal Processing: Signal Processing: Empowering Science and Technology for Humankind - Brighton , United Kingdom Duration: 12 May 2019 → 17 May 2019 Conference number: 44 https://2019.ieeeicassp.org/ |
Publication series
Name | |
---|---|
Publisher | IEEE |
ISSN (Print) | 1520-6149 |
ISSN (Electronic) | 2379-190X |
Conference
Conference | 44th International Conference on Acoustics, Speech, and Signal Processing |
---|---|
Abbreviated title | ICASSP 2019 |
Country/Territory | United Kingdom |
City | Brighton |
Period | 12/05/19 → 17/05/19 |
Internet address |
Keywords / Materials (for Non-textual outputs)
- ultrasound
- ultrasound tongue imaging
- speaker independent
- speech therapy
- Child Speech
Fingerprint
Dive into the research topics of 'Speaker-Independent Classification of Phonetic Segments from Raw Ultrasound in Child Speech'. Together they form a unique fingerprint.Projects
- 1 Finished
-
Ultrax2020: Ultrasound Technology for Optimising the Treatment of Speech Disorders
1/08/17 → 30/11/21
Project: Research