Interface design strategies for computer-assisted speech transcription

Saturnino Luz*, Masood Masoodian, Bill Rogers, Chris Deering

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

A set of user interface design techniques for computer-assisted speech transcription are presented and evaluated with respect to task performance and usability. These techniques include error-correction mechanisms which originated in dictation systems and audio editors as well as new techniques developed by us which exploit specific characteristics of existing speech recognition technologies in order to facilitate transcription in settings that typically yield considerable recognition inaccuracy, such as when the speech to be transcribed was produced by different speakers. In particular, we describe a mechanism for dynamic propagation of user feedback which progressively adapts the system to different speakers and lexical contexts. Results of usability and performance evaluation trials indicate that feedback propagation, menu-based correction coupled with keyboard interaction and text-driven audio playback are positively perceived by users and result in improved transcript accuracy.

Original languageEnglish
Title of host publicationProceedings of the 20th Australasian Conference on Computer-Human Interaction
Subtitle of host publicationDesigning for Habitus and Habitat, OZCHI'08
Pages203-210
Number of pages8
DOIs
Publication statusPublished - Dec 2008
Event20th Australasian Conference on Computer-Human Interaction: Designing for Habitus and Habitat, OZCHI'08 - Cairns, QLD, Australia
Duration: 8 Dec 200812 Dec 2008

Publication series

NameProceedings of the 20th Australasian Conference on Computer-Human Interaction: Designing for Habitus and Habitat, OZCHI'08

Conference

Conference20th Australasian Conference on Computer-Human Interaction: Designing for Habitus and Habitat, OZCHI'08
Country/TerritoryAustralia
CityCairns, QLD
Period8/12/0812/12/08

Keywords / Materials (for Non-textual outputs)

  • Automatic speech recognition
  • Computer-assisted speech transcription
  • Error correction

Fingerprint

Dive into the research topics of 'Interface design strategies for computer-assisted speech transcription'. Together they form a unique fingerprint.

Cite this