Discourse Constraints for Document Compression

James Clarke, Mirella Lapata

Research output: Contribution to journalArticlepeer-review


Sentence compression holds promise for many applications ranging from summarization to subtitle generation. The task is typically performed on isolated sentences without taking the surrounding context into account, even though most applications would operate over entire documents. In this article we present a discourse-informed model which is capable of producing document compressions that are coherent and informative. Our model is inspired by theories of local coherence and formulated within the framework of integer linear programming. Experimental results show significant improvements over a state-of-the-art discourse agnostic approach.
Original languageEnglish
Pages (from-to)411-441
Number of pages31
JournalComputational Linguistics
Issue number3
Publication statusPublished - Sep 2010

Fingerprint Dive into the research topics of 'Discourse Constraints for Document Compression'. Together they form a unique fingerprint.

Cite this