Abstract / Description of output
We present CerFix, a data cleaning system that finds certain fixes for tuples at the point of data entry, i.e., fixes that are guaranteed correct. It is based on master data, editing rules and certain regions. Given some attributes of an in-
put tuple that are validated (assured correct), editing rules tell us what other attributes to x and how to correct them with master data. A certain region is a set of attributes that, if validated, warrant a certain x for the entire tuple. We demonstrate the following facilities provided by Cer-Fix: (1) a region finder to identify certain regions; (2) a data monitor to find certain fixes for input tuples, by guiding users to validate a minimal number of attributes; and
(3) an auditing module to show what attributes are fixed and where the correct values come from.
put tuple that are validated (assured correct), editing rules tell us what other attributes to x and how to correct them with master data. A certain region is a set of attributes that, if validated, warrant a certain x for the entire tuple. We demonstrate the following facilities provided by Cer-Fix: (1) a region finder to identify certain regions; (2) a data monitor to find certain fixes for input tuples, by guiding users to validate a minimal number of attributes; and
(3) an auditing module to show what attributes are fixed and where the correct values come from.
Original language | English |
---|---|
Pages (from-to) | 1375-1378 |
Number of pages | 4 |
Journal | Proceedings of the VLDB Endowment (PVLDB) |
Volume | 4 |
Issue number | 12 |
Publication status | Published - 2011 |