Modelling Compression with Discourse Constraints

James Clarke, Mirella Lapata

Research output: Chapter in Book/Report/Conference proceedingConference contribution


Sentence compression holds promise for many applications ranging from summarisation to subtitle generation and subtitle generation. The task is typically performed on isolated sentences without taking the surrounding context into account, even though most applications would operate over entire documents. In this paper we present a discourse informed model which is capable of producing document compressions that are coherent and informative. Our model is inspired by theories of local coherence and formulated within the framework of Integer
Linear Programming. Experimental results show significant improvements over a state-of-the-art discourse agnostic approach.
Original languageEnglish
Title of host publicationProceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
PublisherAssociation for Computational Linguistics
Number of pages11
Publication statusPublished - 2007

Cite this