Abstract
Using the Methodius Natural Language Generation (NLG) System, we have created a corpus which consists of a collection of generated texts which describe ancient Greek artefacts. Each text is linked to two representations created as part of the NLG process. The first is a content plan, which uses rhetorical relations to describe the high-level discourse structure of the text, and the second is a logical form describing the syntactic structure, which is sent to the OpenCCG surface realization module to produce the final text output. In
recent work, White and Howcroft (2015) have used the SPaRKy restaurant corpus, which contains similar combination of texts and representations, for their research on the induction of rules for the combination of clauses. In the first instance this corpus will be used to test their algorithms on an additional domain, and extend their work to include the learning of referring expression generation rules. As far as we know, the SPaRKy restaurant corpus is the only existing corpus of this type, and we hope that the creation of this new corpus
in a different domain will provide a useful resource to the Natural Language Generation community.
recent work, White and Howcroft (2015) have used the SPaRKy restaurant corpus, which contains similar combination of texts and representations, for their research on the induction of rules for the combination of clauses. In the first instance this corpus will be used to test their algorithms on an additional domain, and extend their work to include the learning of referring expression generation rules. As far as we know, the SPaRKy restaurant corpus is the only existing corpus of this type, and we hope that the creation of this new corpus
in a different domain will provide a useful resource to the Natural Language Generation community.
Original language | English |
---|---|
Title of host publication | Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16) |
Publisher | European Language Resources Association (ELRA) |
Pages | 1732-1736 |
Number of pages | 5 |
ISBN (Print) | 978-2-9517408-9-1 |
Publication status | Published - 13 May 2016 |
Event | 10th edition of the Language Resources and Evaluation Conference - Portorož , Slovenia Duration: 23 May 2016 → 28 May 2016 http://lrec2016.lrec-conf.org/en/ http://www.lrec-conf.org/proceedings/lrec2016/index.html |
Conference
Conference | 10th edition of the Language Resources and Evaluation Conference |
---|---|
Abbreviated title | LREC 2016 |
Country/Territory | Slovenia |
City | Portorož |
Period | 23/05/16 → 28/05/16 |
Internet address |