Sequence modelling for sentence classification in a legal summarisation system

Ben Hachey, Claire Grover

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract / Description of output

We describe a set of experiments using a wide range of machine learning techniques for the task of predicting the rhetorical status of sentences. The research is part of a text summarisation project for the legal domain for which we use a new corpus of judgments of the UK House of Lords. We present experimental results for classification according to a rhetorical scheme indicating a sentence's contribution to the overall argumentative structure of the legal judgments using four learning algorithms from the Weka package (C4.5, naïve Bayes, Winnow and SVMs). We also report results using maximum entropy models both in a standard classification framework and in a sequence labelling framework. The SVM classifier and the maximum entropy sequence tagger yield the most promising results.
Original languageEnglish
Title of host publicationProceedings of the 2005 ACM Symposium on Applied Computing (SAC), Santa Fe, New Mexico, USA, March 13-17, 2005
Number of pages5
ISBN (Print)1-58113-964-0
Publication statusPublished - 2005


Dive into the research topics of 'Sequence modelling for sentence classification in a legal summarisation system'. Together they form a unique fingerprint.

Cite this