Cross-lingual RST Discourse Parsing

Chloe Braud, Maximin Coavoux, Anders Søgaard

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract / Description of output

Discourse parsing is an integral part of understanding information flow and argumentative structure in documents. Most previous research has focused on inducing and evaluating models from the English RST Discourse Treebank. However, discourse treebanks for other languages exist, including Spanish, German, Basque, Dutch and Brazilian Portuguese. The treebanks share the same underlying linguistic theory, but differ slightly in the way documents are annotated. In this paper, we present (a) a new discourse parser which is simpler, yet competitive (significantly better on 2/3 metrics) to state of the art for English, (b) a harmonization of discourse treebanks across languages, enabling us to present (c) what to the best of our knowledge are the first experiments on crosslingual discourse parsing.
Original languageEnglish
Title of host publication15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1
PublisherAssociation for Computational Linguistics (ACL)
Number of pages13
Publication statusPublished - 1 Apr 2017
EventThe 15th Conference of the European Chapter of the Association for Computational Linguistics - Valencia, Spain
Duration: 3 Apr 20177 Apr 2017

Conference

ConferenceThe 15th Conference of the European Chapter of the Association for Computational Linguistics
Abbreviated titleEACL 2017
Country/TerritorySpain
CityValencia
Period3/04/177/04/17

Fingerprint

Dive into the research topics of 'Cross-lingual RST Discourse Parsing'. Together they form a unique fingerprint.

Cite this