Abstract
This paper proposes a trajectory model which is based on a mixture density network trained with target features augmented with dynamic features together with an algorithm for estimating maximum likelihood trajectories which respects constraints between the static and derived dynamic features. This model was evaluated on an inversion mapping task. We found the introduction of the trajectory model successfully reduced root mean square error by up to 7.5%, as well as increasing correlation scores.
Original language | English |
---|---|
Title of host publication | INTERSPEECH 2006 - ICSLP Ninth International Conference on Spoken Language Processing |
Publisher | International Speech Communication Association |
Publication status | Published - Sept 2006 |