Abstract / Description of output
We present the second version of the Penn Discourse Treebank, PDTB-2.0, describing its lexically-grounded annotations of discourse relations and their two abstract object arguments over the 1 million word Wall Street Journal corpus. We describe all aspects of the annotation, including (a) the argument structure of discourse relations, (b) the sense annotation of the relations, and (c) the attribution of discourse relations and each of their arguments. We list the differences between PDTB-1.0 and PDTB-2.0. We present representative statistics for several aspects of the annotation in the corpus.
Original language | English |
---|---|
Title of host publication | Proceedings of the Sixth International Language Resources and Evaluation (LREC'08) |
Editors | Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Daniel Tapias |
Place of Publication | Marrakech, Morocco |
Publisher | European Language Resources Association (ELRA) |
Pages | 2961-2968 |
Number of pages | 8 |
ISBN (Print) | 2-9517408-4-0 |
Publication status | Published - 1 May 2008 |