Edinburgh Research Explorer

SVitchboard 1: Small Vocabulary Tasks from Switchboard 1

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Related Edinburgh Organisations

Open Access permissions

Open

Documents

  • Download as Adobe PDF

    Accepted author manuscript, 79 KB, PDF document

    Licence: Creative Commons: Attribution No Derivatives (CC-BY-ND)

  • Download as Adobe PDF

    Accepted author manuscript, 213 KB, PDF document

    Licence: Creative Commons: Attribution No Derivatives (CC-BY-ND)

Original languageEnglish
Title of host publicationInterspeech 2005 - Eurospeech
Subtitle of host publication9th European Conference on Speech Communication and Technology
PublisherInternational Speech Communication Association
Pages3385-3388
Number of pages4
ISBN (Print)1990-9772
Publication statusPublished - 2005

Abstract

We present a conversational telephone speech data set designed to support research on novel acoustic models. Small vocabulary tasks from 10 words up to 500 words are defined using subsets of the Switchboard-1 corpus; each task has a completely closed vocabulary (an OOV rate of 0. We justify the need for these tasks, de- scribe the algorithm for selecting them from a large cor- pus, give a statistical analysis of the data and present baseline whole-word hidden Markov model recognition results. The goal of the paper is to define a common data set and to encourage other researchers to use it.

Download statistics

No data available

ID: 2076819