Data Provenance

Barbara Magagna, Doron Goldfarb, Paul Martin, Malcolm Atkinson, Spiros Koulouzis, Zhiming Zhao

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract / Description of output

The provenance of research data is of critical importance to the reproducibility of and trust in scientific results. As research infrastructures provide more amalgamated datasets for researchers and more integrated facilities for processing and publishing data, the capture of provenance in a standard, machine-actionable form becomes especially important. Significant progress has already been made in providing standards and tools for provenance tracking, but the integration of these technologies into research infrastructure remains limited in many scientific domains. Further development and collaboration are required to provide frameworks for provenance capture that can be adopted by as widely as possible, facilitating interoperability as well as dataset reuse. In this chapter, we examine the current state of the art for provenance, and the current state of provenance capture in environmental and Earth science research infrastructures in Europe, as surveyed in the course of the ENVRIplus project. We describe a service developed for the upload, dissemination and application of provenance templates that can be used to generate standardised provenance traces from input data in accordance with current best practice and standards. The use of such a service by research infrastructure architects and researchers can expedite both the understanding and use of provenance technologies, and so drive the standard use of provenance capture technologies in future research infrastructure developments.
Original languageEnglish
Title of host publicationTowards Interoperable Research Infrastructures for Environmental and Earth Sciences: A Reference Model Guided Approach for Common Challenges
EditorsZhiming Zhao, Margareta Hellström
Place of PublicationCham
PublisherSpringer International Publishing
Number of pages18
ISBN (Electronic)978-3-030-52829-4
ISBN (Print)978-3-030-52828-7
Publication statusPublished - 25 Jul 2020

Publication series

NameLecture Notes in Computer Science
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349
NameInformation Systems and Applications, incl. Internet/Web, and HCI

Keywords / Materials (for Non-textual outputs)

  • Provenance
  • Scientific workflow management
  • Research data


Dive into the research topics of 'Data Provenance'. Together they form a unique fingerprint.

Cite this