This paper is about the evaluation of a system that generates short text summaries of voicemail messages, suitable for transmission as text messages. Our approach to summarization is based on a speech-recognized transcript of the voicemail message, from which a set of summary words is extracted. The system uses a classifier to identify the summary words, with each word being identified by a vector of lexical and prosodic features. The features are selected using Parcel, an ROC-based algorithm. Our evaluations of the system, using a slot error rate metric, have compared manual and automatic summarization, and manual and automatic recognition (using two different recognizers). We also report on two subjective evaluations using mean opinion score of summaries, and a set of comprehension tests. The main results from these experiments were that the perceived difference in quality of summarization was affected more by errors resulting from automatic transcription, than by the automatic summarization process.
|Title of host publication||2003 ISCA Workshop on Multilingual Spoken Document Retrieval, Proceedings of the|
|Subtitle of host publication||MSDR 2003|
|Publication status||Published - 2003|
|Event||2003 ISCA Workshop on Multilingual Spoken Document Retrieval (MSDR2003) - Hong Kong, Hong Kong|
Duration: 4 Apr 2003 → 5 Apr 2003
|Workshop||2003 ISCA Workshop on Multilingual Spoken Document Retrieval (MSDR2003)|
|Period||4/04/03 → 5/04/03|