This paper reports on present work, in which a recurrent neural network is trained to estimate 'velum height' during continuous speech. Parallel acoustic-articulatory data comprising more than 400 read TIMIT sentences is obtained using electromagnetic articulography (EMA). This data is processed and used as training data for a range of neural network sizes. The network demonstrating the highest accuracy is identified. This performance is then evaluated in detail by analysing the network's output for each phonetic segment contained in 50 hand-labelled utterances set aside for testing purposes.
|Title of host publication||Sixth European Conference on Speech Communication and Technology (EUROSPEECH'99)|
|Publisher||International Speech Communication Association|
|Number of pages||4|
|Publication status||Published - 1999|