Stochastic pronunciation modelling for spoken term detection

D. Wang, S. King, J. Frankel

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

A major challenge faced by a spoken term detection (STD) system is the detection of out-of-vocabulary (OOV) terms. Although a subword-based STD system is able to detect OOV terms, performance reduction is always observed compared to in-vocabulary terms. Current approaches to STD do not acknowledge the particular properties of OOV terms, such as pronunciation uncertainty. In this paper, we use a stochastic pronunciation model to deal with the uncertain pronunciations of OOV terms. By considering all possible term pronunciations, predicted by a joint-multigram model, we observe a significant performance improvement.
Original languageEnglish
Title of host publicationProceedings of Interspeech 2009 Brighton
Pages2135-2138
Number of pages4
Publication statusPublished - 2009

Fingerprint Dive into the research topics of 'Stochastic pronunciation modelling for spoken term detection'. Together they form a unique fingerprint.

Cite this