Projects per year
Abstract
Answering queries over incomplete data is based on finding answers that are certainly true, independently of how missing values are interpreted. This informal description has given rise to several different mathematical definitions of certainty. To unify them, a framework based on ”explanations”, or extra information about incomplete data, was recently proposed. It partly succeeded in justifying query answering methods for relational databases under set semantics, but had two major limitations. First, it was firmly tied to the set data model, and a fixed way of comparing incomplete databases with respect to their information content. These assumptions fail for real-life database queries in languages such as SQL that use bag semantics instead. Second, it was restricted to queries that only manipulate data, while in practice most analytical SQL queries invent new values, typically via arithmetic operations and aggregation.
To leverage our understanding of the notion of certainty for queries for SQL-like languages, we consider incomplete databases whose information content may be enriched by additional knowledge. The knowledge order among them is derived from their semantics, rather than being fixed a priori. The resulting framework allows us to capture and justify existing notions of certainty, and extend these concepts to other data models and query languages. As natural applications, we provide for the first time a well-founded definition of certain answers for the relational bag data model, and for value-inventing queries on incomplete databases, addressing the key shortcomings of the previous approaches.
To leverage our understanding of the notion of certainty for queries for SQL-like languages, we consider incomplete databases whose information content may be enriched by additional knowledge. The knowledge order among them is derived from their semantics, rather than being fixed a priori. The resulting framework allows us to capture and justify existing notions of certainty, and extend these concepts to other data models and query languages. As natural applications, we provide for the first time a well-founded definition of certain answers for the relational bag data model, and for value-inventing queries on incomplete databases, addressing the key shortcomings of the previous approaches.
Original language | English |
---|---|
Title of host publication | Proceedings of the 17th International Conference on Principles of Knowledge Representation and Reasoning — Main Track |
Editors | Diego Calvanese, Esra Erdem, Michael Thielscher |
Publisher | International Joint Conferences on Artificial Intelligence Organization |
Pages | 758–767 |
Number of pages | 10 |
ISBN (Print) | 978-0-9992411-7-2 |
DOIs | |
Publication status | Published - 12 Sept 2020 |
Event | 17th International Conference on Principles of Knowledge Representation and Reasoning - Rhodes, Greece Duration: 12 Sept 2020 → 18 Sept 2020 https://kr2020.inf.unibz.it/ |
Publication series
Name | |
---|---|
ISSN (Print) | 2334-1033 |
Conference
Conference | 17th International Conference on Principles of Knowledge Representation and Reasoning |
---|---|
Abbreviated title | KR 2020 |
Country/Territory | Greece |
City | Rhodes |
Period | 12/09/20 → 18/09/20 |
Internet address |
Fingerprint
Dive into the research topics of 'Knowledge-preserving Certain Answers for SQL-like Queries'. Together they form a unique fingerprint.Projects
- 2 Finished
-
MAGIC: MAnaGing InComplete Data - New Foundations
Libkin, L. (Principal Investigator)
1/10/16 → 31/08/22
Project: Research
-
VADA: Value Added Data Systems: Principles and Architecture
Libkin, L. (Principal Investigator), Buneman, P. (Co-investigator), Fan, W. (Co-investigator) & Pieris, A. (Co-investigator)
1/04/15 → 30/09/20
Project: Research