Generating corrupted data sources for the evaluation of matching systems

Fiona McNeill, Diana Bental, Alasdair J. G. Gray, Sabina Jedrzejczyk, Ahmad Alsadeeqi

Research output: Chapter in Book/Report/Conference proceedingConference contribution


One of the most difficult aspects of developing matching systems – whether for matching ontologies or for other types of mismatched data – is evaluation. The accuracy of matchers are usually evaluated by measuring the results produced by the systems against reference sets, but gold-standard reference sets are expensive and difficult to create. In this paper we introduce crptr, which generates multiple variations of different sorts of dataset, where the degree of variation is controlled, in order that they can be used to evaluate matchers in different context.
Original languageEnglish
Title of host publicationThe Fourteenth International Workshop on Ontology Matching
PublisherCEUR Workshop Proceedings (
Number of pages4
Publication statusPublished - 16 Jan 2020
Event14th International Workshop on Ontology Matching - Owen G. Glenn Building, The University of Auckland, Auckland, New Zealand
Duration: 26 Oct 201926 Oct 2019

Publication series

NameCEUR Workshop Proceedings
ISSN (Electronic)1613-0073


Workshop14th International Workshop on Ontology Matching
Abbreviated titleOM 2019
CountryNew Zealand
Internet address


  • Matching
  • Evaluation
  • Data Corruption

Fingerprint Dive into the research topics of 'Generating corrupted data sources for the evaluation of matching systems'. Together they form a unique fingerprint.

Cite this