Projects per year
Abstract
We implement two deep architectures for the acoustic-articulatory inversion mapping problem: a deep neural network and a deep trajectory mixture density network. We find that in both cases, deep architectures produce more accurate predictions than shallow architectures and that this is due to the higher expressive capability of a deep model and not a consequence of adding more adjustable parameters. We also find that a deep trajectory mixture density network is able to obtain better inversion accuracies than smoothing the results of a deep neural network. Our best model obtained an average root mean square error of 0.885 mm on the MNGU0 test dataset.
Original language | English |
---|---|
Title of host publication | INTERSPEECH 2012 13th Annual Conference of the International Speech Communication Association |
Publisher | ISCA |
Pages | 867-870 |
Number of pages | 4 |
Publication status | Published - Sept 2012 |
Fingerprint
Dive into the research topics of 'Deep Architectures for Articulatory Inversion'. Together they form a unique fingerprint.Projects
- 2 Finished
-
Deep Learning for Speech Recognition
Murray, I. (Principal Investigator) & Renals, S. (Co-investigator)
UK industry, commerce and public corporations
1/10/11 → 31/03/15
Project: Research
-
ULTRAX: Ultrax: Real-time tongue tracking for speech therapy using ultrasound
Richmond, K. (Principal Investigator), Renals, S. (Co-investigator), Cleland, J. (Co-Investigator (External)) & Scobbie, J. M. (Co-Investigator (External))
1/02/11 → 31/07/14
Project: Research