This paper presents a comparison between a hidden Markov model (HMM) based method and a novel artificial neural network (ANN) based method for lip synchronisation. Both model types were trained on motion tracking data, and a perceptual evaluation was carried out comparing the output of the models, both to each other and to the original tracked data. It was found that the ANN-based method was judged significantly better than the HMM based method. Furthermore, the original data was not judged significantly better than the output of the ANN method.
|Title of host publication||INTERSPEECH 2010 11th Annual Conference of the International Speech Communication Association|
|Publisher||International Speech Communication Association|
|Number of pages||4|
|Publication status||Published - Sep 2010|