Abstract
This aticle presents trainable methods for extracting principal content words from voicemail messages. The short text summaries generated are suitable for mobile messaging applications. The system uses a set of classifiers to identify the summary words with each word described by a vector of lexical and prosodic features. We use an ROC-based algorithm, Parcel, to select input features (and classifiers). We have performed a series of objective and subjective evaluations using unseen data from two different speech recognition systems as well as human transcriptions of voicemail speech.
Original language | English |
---|---|
Article number | 1 |
Number of pages | 21 |
Journal | ACM Transactions on Speech and Language Processing |
Volume | 2 |
Issue number | 1 |
DOIs | |
Publication status | Published - 1 Feb 2005 |
Keywords / Materials (for Non-textual outputs)
- Voicemail
- automatic summarization
- feature subset selection
- prosody
- receiver operating characteristic
- short message service