Language-integrated provenance

Stefan Fehrenbach, James Cheney

Research output: Contribution to journalArticlepeer-review

Abstract

Provenance, or information about the origin or derivation of data, is important for assessing the trustworthiness of data and identifying and correcting mistakes. Most prior implementations of data provenance have involved heavyweight modifications to database systems and little attention has been paid to how the provenance data can be used outside such a system. We present extensions to the Links programming language that build on its support for language-integrated query to support provenance queries by rewriting and normalizing monadic comprehensions and extending the type system to distinguish provenance metadata from normal data. The main contribution of this article is to show that the two most common forms of provenance can be implemented efficiently and used safely as a programming language feature with no changes to the database system.
Original languageEnglish
Pages (from-to)103-145
Number of pages43
JournalScience of Computer Programming
Volume155
Early online date12 Sept 2017
DOIs
Publication statusPublished - 1 Apr 2018

Keywords / Materials (for Non-textual outputs)

  • cs.PL
  • cs.DB

Fingerprint

Dive into the research topics of 'Language-integrated provenance'. Together they form a unique fingerprint.

Cite this