Abstract
Discourse relations can either be implicit or explicitly expressed by markers, such as ’therefore’ and ’but’. How a speaker makes this choice is a question that is not well understood. We propose a psycholinguistic model that predicts whether a speaker will produce an explicit marker given the discourse relation s/he wishes to express. Based on the framework of the Rational Speech Acts model, we quantify the utility of producing a marker based on the information-theoretic measure of surprisal, the cost of production, and a bias to maintain uniform information density throughout the utterance. Experiments based on the Penn Discourse Treebank show that our approach outperforms state of-the-art approaches, while giving an explanatory account of the speaker’s choice.
Original language | English |
---|---|
Title of host publication | Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning (CoNLL) |
Place of Publication | Berlin, Germany |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 302–313 |
Number of pages | 12 |
ISBN (Electronic) | 978-1-945626-19-7 |
DOIs | |
Publication status | Published - 12 Aug 2016 |
Event | 20th SIGNLL Conference on Computational Natural Language Learning - Berlin, Germany Duration: 11 Aug 2016 → 12 Aug 2016 http://www.conll.org/2016 |
Conference
Conference | 20th SIGNLL Conference on Computational Natural Language Learning |
---|---|
Abbreviated title | CoNLL |
Country/Territory | Germany |
City | Berlin |
Period | 11/08/16 → 12/08/16 |
Internet address |