Exploring Structured Documents and Query Formulation Techniques for Patent Retrieval

Walid Magdy, Johannes Leveling, Gareth J. F. Jones

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract / Description of output

This paper presents the experiments and results of DCU in CLEF-IP 2009. Our work applied standard information retrieval (IR) techniques to patent search. Different experiments tested various methods for the patent retrieval, including query formulation, structured index, weighted fields, document filtering, and blind relevance feedback. Some methods did not show expected good retrieval effectiveness such as blind relevance feedback, other experiments showed acceptable performance. Query formulation was the key to achieving better retrieval effectiveness, and this was performed through assigning higher weights to certain document fields. Further experiments showed that for longer queries, better results are achieved but at the expense of additional computations. For the best runs, the retrieval effectiveness is still lower than for IR applications for other domains, illustrating the difficulty of patent search. The official results have shown that among fifteen participants we achieved the seventh and the fourth ranks from the mean average precision (MAP) and recall point of view, respectively.

Download to read the full conference paper text
Original languageEnglish
Title of host publicationMultilingual Information Access Evaluation I. Text Retrieval Experiments, 10th Workshop of the Cross-Language Evaluation Forum, CLEF 2009, Corfu, Greece, September 30 - October 2, 2009, Revised Selected Papers
PublisherSpringer Berlin Heidelberg
Pages410-417
Number of pages8
ISBN (Electronic)978-3-642-15754-7
ISBN (Print)978-3-642-15753-0
DOIs
Publication statusPublished - 2009

Publication series

NameLecture Notes in Computer Science (LNCS)
Publisher Springer Berlin Heidelberg
Volume6241
ISSN (Print)0302-9743

Fingerprint

Dive into the research topics of 'Exploring Structured Documents and Query Formulation Techniques for Patent Retrieval'. Together they form a unique fingerprint.

Cite this