The Development of the AMI System for the Transcription of Speech in Meetings

Thomas Hain, Lukáš Burget, John Dines, Iain McCowan, Giulia Garau, Martin Karafiat, Mike Lincoln, Darren Moore, Vincent Wan, Roeland Ordelman, Steve Renals

Research output: Chapter in Book/Report/Conference proceedingChapter


The automatic processing of speech collected in conference style meetings has attracted considerable interest with several large scale projects devoted to this area. This paper describes the development of a baseline automatic speech transcription system for meetings in the context of the AMI (Augmented Multiparty Interaction) project. We present several techniques important to processing of this data and show the performance in terms of word error rates (WERs). An important aspect of transcription of this data is the necessary flexibility in terms of audio pre-processing. Real world systems have to deal with flexible input, for example by using microphone arrays or randomly placed microphones in a room. Automatic segmentation and microphone array processing techniques are described and the effect on WERs is discussed. The system and its components presented in this paper yield competitive performance and form a baseline for future research in this domain.
Original languageEnglish
Title of host publicationMachine Learning for Multimodal Interaction
Subtitle of host publicationSecond International Workshop, MLMI 2005, Edinburgh, UK, July 11-13, 2005, Revised Selected Papers
EditorsSteve Renals, Samy Bengio
Place of PublicationBerlin, Heidelberg
PublisherSpringer Berlin Heidelberg
Number of pages13
ISBN (Electronic)978-3-540-32550-5
ISBN (Print)978-3-540-32549-9
Publication statusPublished - 2006
EventSecond International Workshop (MLMI 2005) - Edinburgh, United Kingdom
Duration: 11 Jul 200513 Jul 2005

Publication series

NameLecture Notes in Computer Science
PublisherSpringer Berlin Heidelberg
ISSN (Print)0302-9743


WorkshopSecond International Workshop (MLMI 2005)
CountryUnited Kingdom

Fingerprint Dive into the research topics of 'The Development of the AMI System for the Transcription of Speech in Meetings'. Together they form a unique fingerprint.

Cite this