CRF-based Stochastic Pronunciation Modelling for Out-of-Vocabulary Spoken Term Detection

Dong Wang, Simon King, Nick Evans, Raphael Troncy

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Out-of-vocabulary (OOV) terms present a significant challenge to spoken term detection (STD). This challenge, to a large extent, lies in the high degree of uncertainty in pronunciations of OOV terms. In previous work, we presented a stochastic pronunciation modeling (SPM) approach to compensate for this uncertainty. A shortcoming of our original work, however, is that the SPM was based on a joint-multigram model (JMM), which is suboptimal. In this paper, we propose to use conditional random fields (CRFs) for letter-to-sound conversion, which significantly improves quality of the predicted pronunciations. When applied to OOV STD, we achieve consider- able performance improvement with both a 1-best system and an SPM-based system.
Original languageEnglish
Title of host publicationInterspeech 2010
Subtitle of host publication11th Annual Conference of the International Speech Communication Association
PublisherInternational Speech Communication Association
Pages1668-1671
Number of pages4
ISBN (Print)1990-9772
Publication statusPublished - 1 Sept 2010

Fingerprint

Dive into the research topics of 'CRF-based Stochastic Pronunciation Modelling for Out-of-Vocabulary Spoken Term Detection'. Together they form a unique fingerprint.

Cite this