SVitchboard 1: Small Vocabulary Tasks from Switchboard 1

Simon King, Chris Bartels, Jeff Bilmes

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We present a conversational telephone speech data set designed to support research on novel acoustic models. Small vocabulary tasks from 10 words up to 500 words are defined using subsets of the Switchboard-1 corpus; each task has a completely closed vocabulary (an OOV rate of 0. We justify the need for these tasks, de- scribe the algorithm for selecting them from a large cor- pus, give a statistical analysis of the data and present baseline whole-word hidden Markov model recognition results. The goal of the paper is to define a common data set and to encourage other researchers to use it.
Original languageEnglish
Title of host publicationInterspeech 2005 - Eurospeech
Subtitle of host publication9th European Conference on Speech Communication and Technology
PublisherInternational Speech Communication Association
Pages3385-3388
Number of pages4
ISBN (Print)1990-9772
Publication statusPublished - 2005

Fingerprint

Dive into the research topics of 'SVitchboard 1: Small Vocabulary Tasks from Switchboard 1'. Together they form a unique fingerprint.

Cite this