Validating the Web-based Evaluation of NLG Systems

Alexander Koller, Kristina Striegnitz, Donna Byron, Justine Cassell, Robert Dale, Sara Dalzel-Job, Jon Oberlander, Johanna Moore

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The GIVE Challenge is a recent shared task in which NLG systems are evaluated over the Internet. In this paper, we validate this novel NLG evaluation methodology by comparing the Internet-based results with results we collected in a lab experiment. We find that the results delivered by both methods are consistent, but the Internet-based approach offers the statistical power necessary for more fine-grained evaluations and is cheaper to carry out.
Original languageEnglish
Title of host publicationProceedings of the ACL-IJCNLP 2009 Conference Short Papers
Place of PublicationStroudsburg, PA, USA
PublisherAssociation for Computational Linguistics
Pages301-304
Number of pages4
Publication statusPublished - 4 Aug 2009
EventACL-IJCNLP 2009 Conference - Suntec, Singapore
Duration: 4 Aug 20094 Aug 2009

Publication series

NameACLShort '09
PublisherAssociation for Computational Linguistics

Conference

ConferenceACL-IJCNLP 2009 Conference
Abbreviated titleACLShort '09
Country/TerritorySingapore
CitySuntec
Period4/08/094/08/09

Fingerprint

Dive into the research topics of 'Validating the Web-based Evaluation of NLG Systems'. Together they form a unique fingerprint.

Cite this