Projects per year
We present a conversational telephone speech data set designed to support research on novel acoustic models. Small vocabulary tasks from 10 words up to 500 words are defined using subsets of the Switchboard-1 corpus; each task has a completely closed vocabulary (an OOV rate of 0. We justify the need for these tasks, de- scribe the algorithm for selecting them from a large cor- pus, give a statistical analysis of the data and present baseline whole-word hidden Markov model recognition results. The goal of the paper is to define a common data set and to encourage other researchers to use it.
|Title of host publication||Interspeech 2005 - Eurospeech|
|Subtitle of host publication||9th European Conference on Speech Communication and Technology|
|Publisher||International Speech Communication Association|
|Number of pages||4|
|Publication status||Published - 2005|