Edinburgh Research Explorer

Handling overlaps in spoken term detection

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Related Edinburgh Organisations

Open Access permissions



  • Download as Adobe PDF

    Rights statement: Wang, D., Evans, N., Troncy, R., & King, S. (2011). Handling overlaps in spoken term detection. In Proc. International Conference on Acoustics, Speech and Signal Processing. (pp. 5656-5659). doi: 10.1109/ICASSP.2011.5947643

    Accepted author manuscript, 102 KB, PDF document

Original languageEnglish
Title of host publicationProc. International Conference on Acoustics, Speech and Signal Processing
Number of pages4
Publication statusPublished - 1 May 2011


Spoken term detection (STD) systems usually arrive at many overlapping detections which are often addressed with some pragmatic approaches, e.g. choosing the best detection to represent all the overlaps. In this paper we present a theoretical study based on a concept of acceptance space. In particular, we present two confidence estimation approaches based on Bayesian and evidence perspectives respectively. Analysis shows that both approaches possess respective ad vantages and shortcomings, and that their combination has the potential to provide an improved confidence estimation. Experiments conducted on meeting data confirm our analysis and show considerable performance improvement with the combined approach, in particular for out-of-vocabulary spoken term detection with stochastic pronunciation modeling.

Download statistics

No data available

ID: 2077827