On the Data Complexity of Relative Information Completeness

Yang Cao, Ting Deng*, Wenfei Fan, Floris Geerts

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract / Description of output

Databases in an enterprise are often partially closed: parts of their data must be contained in master data, which has complete information about the core business entities of the enterprise. With this comes the need for studying relative information completeness: a partially closed database is said to be complete for a query relative to master data if it has complete information to answer the query, i.e., extending the database by adding more tuples either does not change its answer to the query or makes it no longer partially closed w.r.t. the master data. This paper investigates three problems associated with relative information completeness. Given a query Q and a partially closed database D master data D-m, (1) the relative completeness problem is to decide whether D is complete for Q relative to D-m; (2) the minimal completeness problem is to determine whether D is a minimal database that is complete for Q relative to D-m; and (3) the bounded extension problem is to decide whether it suffices to extend D by adding at most K tuples, such that the extension makes a partially closed database that is complete for Q relative to D-m. While the combined complexity bounds of the relative completeness problem and the minimal completeness problem are already known, neither their data complexity nor the bounded extension problem has been studied. We establish upper and lower bounds of these problems for data complexity, all matching, for Q expressed in a variety of query languages. (C) 2014 Elsevier Ltd. All rights reserved.

Original languageEnglish
Pages (from-to)18-34
Number of pages17
JournalInformation Systems
Volume45
DOIs
Publication statusPublished - Sept 2014

Keywords / Materials (for Non-textual outputs)

  • Incomplete information
  • Relative completeness
  • Master data management
  • Partially closed databases
  • Data complexity

Fingerprint

Dive into the research topics of 'On the Data Complexity of Relative Information Completeness'. Together they form a unique fingerprint.

Cite this