Abstract
This paper presents a novel data-driven approach to summarizing spoken audio transcripts utilizing lexical and prosodic features. The former are obtained from a speech recognizer and the latter are extracted automatically from speech waveforms. We employ a feature subset selection algorithm, based on ROC curves, which examines different combinations of features at different target operating conditions. The approach is evaluated on the IBM Voicemail corpus, demonstrating that it is possible and desirable to avoid complete commitment to a single best classifier or feature set.
Original language | English |
---|---|
Title of host publication | 7th European Conference on Speech Communication and Technology |
Subtitle of host publication | Eurospeech 2001 Scandinavia |
Editors | Paul Dalsgaard, Børge Lindberg, Henrik Benner, Zheng-Hua Tan |
Place of Publication | Aalborg, Denmark |
Publisher | Kommunik Grafiske Løsninger |
Pages | 2377-2380 |
ISBN (Print) | 87-90834-10-0 |
Publication status | Published - 2001 |
Event | 7th European Conference on Speech Communication and Technology (Eurospeech 2001 Scandinavia) - Aalborg Congress and Culture Centre, Aalborg, Denmark Duration: 3 Sep 2001 → 7 Sep 2001 |
Conference
Conference | 7th European Conference on Speech Communication and Technology (Eurospeech 2001 Scandinavia) |
---|---|
Country/Territory | Denmark |
City | Aalborg |
Period | 3/09/01 → 7/09/01 |