Modelling the Uncertainty in Recovering Articulation from Acoustics

K. Richmond, S. King, Paul Taylor

Research output: Contribution to journalArticlepeer-review


This paper presents an experimental comparison of the performance of the multilayer perceptron (MLP) with that of the mixture density network (MDN) for an acoustic-to-articulatory mapping task. A corpus of acoustic-articulatory data recorded by electromagnetic articulography (EMA) for a single speaker was used as training and test data for this purpose. In theory, the MDN is able to provide a richer, more flexible description of the target variables in response to a given input vector than the least-squares trained MLP. Our results show that the mean likelihoods of the target articulatory parameters for an unseen test set were indeed consistently higher with the MDN than with the MLP. The increase ranged from approximately 3% to 22 depending on the articulatory channel in question. On the basis of these results, we argue that using a more flexible description of the target domain, such as that offered by the MDN, can prove beneficial when modelling the acoustic-to-articulatory mapping.
Original languageEnglish
Pages (from-to)153-172
Number of pages20
JournalComputer Speech and Language
Issue number2-3
Publication statusPublished - Apr 2003


Dive into the research topics of 'Modelling the Uncertainty in Recovering Articulation from Acoustics'. Together they form a unique fingerprint.

Cite this