Data Provenance

Barbara Magagna, Doron Goldfarb, Paul Martin, Malcolm Atkinson, Spiros Koulouzis, Zhiming Zhao

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

The provenance of research data is of critical importance to the reproducibility of and trust in scientific results. As research infrastructures provide more amalgamated datasets for researchers and more integrated facilities for processing and publishing data, the capture of provenance in a standard, machine-actionable form becomes especially important. Significant progress has already been made in providing standards and tools for provenance tracking, but the integration of these technologies into research infrastructure remains limited in many scientific domains. Further development and collaboration are required to provide frameworks for provenance capture that can be adopted by as widely as possible, facilitating interoperability as well as dataset reuse. In this chapter, we examine the current state of the art for provenance, and the current state of provenance capture in environmental and Earth science research infrastructures in Europe, as surveyed in the course of the ENVRIplus project. We describe a service developed for the upload, dissemination and application of provenance templates that can be used to generate standardised provenance traces from input data in accordance with current best practice and standards. The use of such a service by research infrastructure architects and researchers can expedite both the understanding and use of provenance technologies, and so drive the standard use of provenance capture technologies in future research infrastructure developments.
Original languageEnglish
Title of host publicationTowards Interoperable Research Infrastructures for Environmental and Earth Sciences: A Reference Model Guided Approach for Common Challenges
EditorsZhiming Zhao, Margareta Hellström
Place of PublicationCham
PublisherSpringer
Pages208-225
Number of pages18
Edition1
ISBN (Electronic)978-3-030-52829-4
ISBN (Print)978-3-030-52828-7
DOIs
Publication statusPublished - 25 Jul 2020

Publication series

NameLecture Notes in Computer Science
PublisherSpringer
Volume12003
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349
NameInformation Systems and Applications, incl. Internet/Web, and HCI
PublisherSpringer
Volume12003

Keywords / Materials (for Non-textual outputs)

  • Provenance
  • Scientific workflow management
  • Research data

Fingerprint

Dive into the research topics of 'Data Provenance'. Together they form a unique fingerprint.

Cite this