Abstract
We present an investigation of the relevance of simple conversational features as indicators of topic shifts in small-group meetings. Three proposals for representation of dialogue data are described, and their effectiveness assessed at detecting topic boundaries on a large section of the Augmented Multi-Party Interaction (AMI) corpus. These proposals consist in representing a speech event through combinations of features such as the lengths of vocalisations, pauses and speech overlaps in the immediate temporal context of the event. Results show that timing of vocalisations alone, within a 7-vocalisation window (3 on each side of the vocalisation under consideration), can be an effective predictor of topic boundaries, outperforming topic segmentation methods based on lexical features. Pause and overlap information on their own also yield comparably good segmentation accuracy, suggesting that simple methods could complement or even serve as alternatives to methods which require more demanding speech processing for meeting browsing.
Original language | English |
---|---|
Pages | 1369-1372 |
Number of pages | 4 |
Publication status | Published - 2010 |
Event | 11th Annual Conference of the International Speech Communication Association: Spoken Language Processing for All, INTERSPEECH 2010 - Makuhari, Chiba, Japan Duration: 26 Sept 2010 → 30 Sept 2010 |
Conference
Conference | 11th Annual Conference of the International Speech Communication Association: Spoken Language Processing for All, INTERSPEECH 2010 |
---|---|
Country/Territory | Japan |
City | Makuhari, Chiba |
Period | 26/09/10 → 30/09/10 |
Keywords / Materials (for Non-textual outputs)
- Conversational features
- Dialogue
- Meeting browsing
- Multi-party interaction
- Speech
- Topic segmentation