Trajectory Mixture Density Networks With Multiple Mixtures for Acoustic-Articulatory Inversion

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We have previously proposed a trajectory model which is based on a mixture density network (MDN) trained with target variables augmented with dynamic features together with an algorithm for estimating maximum likelihood trajectories which respects the constraints between those features. In this paper, we have extended that model to allow diagonal covariance matrices and multiple mixture components in the trajectory MDN output probability density functions. We have evaluated this extended model on an inversion mapping task and found the trajectory model works well, outperforming smoothing of equivalent trajectories using low-pass filtering. Increasing the number of mixture components in the TMDN improves results further.
Original languageEnglish
Title of host publicationAdvances in Nonlinear Speech Processing, International Conference on Non-Linear Speech Processing, NOLISP 2007
EditorsM. Chetouani, A. Hussain, B. Gas, M. Milgram, J.-L. Zarader
PublisherSpringer
Pages263-272
Number of pages10
ISBN (Electronic)978-3-540-77347-4
ISBN (Print)978-3-540-77346-7
DOIs
Publication statusPublished - Dec 2007

Publication series

NameLecture Notes in Computer Science
PublisherSpringer Berlin / Heidelberg
Volume4885
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Fingerprint

Dive into the research topics of 'Trajectory Mixture Density Networks With Multiple Mixtures for Acoustic-Articulatory Inversion'. Together they form a unique fingerprint.

Cite this