Abstract / Description of output
Being able to detect topics and speaker stances in conversations is a key requirement for developing spoken language understanding systems that are personalized and adaptive. In this work, we explore how topic-oriented speaker stance is expressed in conversational speech. To do this, we present a new set of topic and stance annotations of the CallHome corpus of spontaneous dialogues. Specifically, we focus on six stances—positivity, certainty, surprise, amusement, interest, and comfort—which are useful for characterizing important aspects of a conversation, such as whether a conversation is going well or not. Based on this, we investigate the use of neural network models for automatically detecting speaker stance from speech in multi-turn, multi-speaker contexts. In particular, we examine how performance changes depending on how input feature representations are constructed and how this is related to dialogue structure. Our experiments show that incorporating both lexical and acoustic features is beneficial for stance detection. However, we observe variation in whether using hierarchical models for encoding lexical and acoustic information improves performance, suggesting that some aspects of speaker stance are expressed more locally than others. Overall, our findings highlight the importance of modelling interaction dynamics and non-lexical content for stance detection.
Original language | English |
---|---|
Title of host publication | Proceedings of Interspeech 2019 |
Publisher | International Speech Communication Association |
Pages | 46-50 |
Number of pages | 5 |
DOIs | |
Publication status | Published - 19 Sept 2019 |
Event | Interspeech 2019 - Graz, Austria Duration: 15 Sept 2019 → 19 Sept 2019 https://www.interspeech2019.org/ |
Publication series
Name | Interspeech |
---|---|
Publisher | International Speech Communication Association |
ISSN (Electronic) | 1990-9772 |
Conference
Conference | Interspeech 2019 |
---|---|
Country/Territory | Austria |
City | Graz |
Period | 15/09/19 → 19/09/19 |
Internet address |
Keywords / Materials (for Non-textual outputs)
- spoken language understanding
- affective computing
- stance
- computational paralinguistics
- spoken dialogue
Fingerprint
Dive into the research topics of 'Detecting Topic-Oriented Speaker Stance in Conversational Speech'. Together they form a unique fingerprint.Profiles
-
Catherine Lai
- School of Philosophy, Psychology and Language Sciences - Lecturer in Speech and Language Processing
- Institute of Language, Cognition and Computation
- Centre for Speech Technology Research
Person: Academic: Research Active
-