Edinburgh Research Explorer

Speaker-Independent Mel-cepstrum Estimation from Articulator Movements Using D-vector Input

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Related Edinburgh Organisations

Open Access permissions

Open

Documents

  • Download as Adobe PDF

    Accepted author manuscript, 579 KB, PDF document

http://www.interspeech2020.org/uploadfile/pdf/Wed-3-4-4.pdf
Original languageEnglish
Title of host publicationInterspeech 2020 Proceedings
Place of PublicationShanghai, China
PublisherISCA
Pages3176-3180
DOIs
Publication statusPublished - 25 Oct 2020
EventInterspeech 2020 - Virtual Conference, China
Duration: 25 Oct 202029 Oct 2020
http://www.interspeech2020.org/

Publication series

Name
ISSN (Print)1990-9772

Conference

ConferenceInterspeech 2020
Abbreviated titleINTERSPEECH 2020
CountryChina
CityVirtual Conference
Period25/10/2029/10/20
Internet address

Abstract

We describe a speaker-independent mel-cepstrum estimation system which accepts electromagnetic articulography (EMA) data as input. The system collects speaker information with d-vectors generated from the EMA data. We have also investigated the effect of speaker independence in the input vectors given to the mel-cepstrum estimator. This is accomplished by introducing a two-stage network, where the first stage is trained to output EMA sequences that are averaged across all speakers on a per-triphone basis (and so are speaker-independent) and the second receives these as input for mel-cepstrum estimation. Experimental results show that using the d-vectors can improve the performance of mel-cepstrum estimation by 0.19 dB with regard to mel-cepstrum distortion in the closed-speaker test set. Additionally, giving triphone-averaged EMA data to a mel-cepstrum estimator is shown to improve the performance by a further 0.16 dB, which indicates that the speaker-independent input has a positive effect on mel-cepstrum estimation.

Event

Interspeech 2020

25/10/2029/10/20

Virtual Conference, China

Event: Conference

ID: 171894393