The Penn Discourse TreeBank 2.0.

Rashmi Prasad, Nikhil Dinesh, Alan Lee, Eleni Miltsakaki, Livio Robaldo, Aravind Joshi, Bonnie Webber

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract / Description of output

We present the second version of the Penn Discourse Treebank, PDTB-2.0, describing its lexically-grounded annotations of discourse relations and their two abstract object arguments over the 1 million word Wall Street Journal corpus. We describe all aspects of the annotation, including (a) the argument structure of discourse relations, (b) the sense annotation of the relations, and (c) the attribution of discourse relations and each of their arguments. We list the differences between PDTB-1.0 and PDTB-2.0. We present representative statistics for several aspects of the annotation in the corpus.
Original languageEnglish
Title of host publicationProceedings of the Sixth International Language Resources and Evaluation (LREC'08)
EditorsNicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Daniel Tapias
Place of PublicationMarrakech, Morocco
PublisherEuropean Language Resources Association (ELRA)
Pages2961-2968
Number of pages8
ISBN (Print)2-9517408-4-0
Publication statusPublished - 1 May 2008

Fingerprint

Dive into the research topics of 'The Penn Discourse TreeBank 2.0.'. Together they form a unique fingerprint.

Cite this