Pronunciation variation in ASR: which variation to model?

Mirjam Wester, Judith M. Kessens, Helmer Strik

Research output: Chapter in Book/Report/Conference proceedingConference contribution


This paper describes how the performance of a continuous speech recognizer for Dutch has been improved by modeling within-word and cross-word pronunciation variation. A relative improvement of 8.8% in WER was found compared to baseline system performance. However, as WERs do not reveal the full effect of modeling pronunciation variation, we performed a detailed analysis of the differences in recognition results that occur due to modeling pronunciation variation and found that indeed a lot of the differences in recognition results are not reflected in the error rates. Furthermore, error analysis revealed that testing sets of variants in isolation does not predict their behavior in combination. However, these results appeared to be corpus dependent.
Original languageEnglish
Title of host publicationSixth International Conference on Spoken Language Processing, ICSLP 2000 / INTERSPEECH 2000, Beijing, China, October 16-20, 2000
Number of pages4
Publication statusPublished - 2000

Fingerprint Dive into the research topics of 'Pronunciation variation in ASR: which variation to model?'. Together they form a unique fingerprint.

Cite this