Abstract / Description of output
Multi-sentence questions (MSQs) are sequences of questions connected by relations which, unlike sequences of standalone questions, need to be answered as a unit. Following Rhetorical Structure Theory (RST), we recognise that different "question discourse relations'' between the subparts of MSQs reflect different speaker intents, and consequently elicit different answering strategies. Correctly identifying these relations is therefore a crucial step in automatically answering MSQs. We identify five different types of MSQs in English, and define five novel relations to describe them. We extract over 162,000 MSQs from Stack Exchange to enable future research. Finally, we implement a high-precision baseline classifier based on surface features.
Original language | English |
---|---|
Title of host publication | Proceedings of the 14th Linguistic Annotation Workshop |
Editors | Stefanie Dipper, Amir Zeldes |
Place of Publication | Barcelona, Spain |
Publisher | Association for Computational Linguistics |
Pages | 138-147 |
Number of pages | 10 |
ISBN (Electronic) | 9781952148330 |
Publication status | Published - 1 Dec 2020 |
Event | The 14th Linguistic Annotation Workshop - Barcelona, Spain Duration: 12 Dec 2020 → 12 Dec 2020 Conference number: 14 https://sigann.github.io/LAW-XIV-2020/ |
Workshop
Workshop | The 14th Linguistic Annotation Workshop |
---|---|
Abbreviated title | LAW XIV |
Country/Territory | Spain |
City | Barcelona |
Period | 12/12/20 → 12/12/20 |
Internet address |