Edinburgh Research Explorer

Semandaq: a data quality system based on conditional functional dependencies

Research output: Contribution to journalArticle

Original languageEnglish
Pages (from-to)1460-1463
Number of pages4
JournalProceedings of the VLDB Endowment (PVLDB)
Volume1
Issue number2
Publication statusPublished - 2008

Abstract

We present SEMANDAQ, a prototype system for improving the quality of relational data. Based on the recently proposed conditional functional dependencies (CFDs), it detects and repairs errors and inconsistencies that emerge as violations of these constraints. We demonstrate the following functionalities supported by SEMANDAQ: (a) an interface for specifying CFDs; (b) a visual tool for automated detection of CFD violations in relational data,
leveraging efficient SQL-based techniques; (c) extensive visual data exploration capabilities that provide the user with various measures of the quality of the data; (d) repair (cleaning) functionality without excess human interaction, built upon CFD-based cleaning algorithms; we show how SEMANDAQ allows for a natural exploration of the quality of the obtained repairs. SEMANDAQ is a promising tool that provides easy access and user-friendly data quality facilities
for any relational database system.

Download statistics

No data available

ID: 17663221