Projects per year
Abstract
We present dispel4py a versatile data-intensive
kit presented as a standard Python library. It empowers
scientists to experiment and test ideas using their familiar
rapid-prototyping environment. It delivers mappings to diverse
computing infrastructures, including cloud technologies, HPC
architectures and specialised data-intensive machines, to move
seamlessly into production with large-scale data loads. The
mappings are fully automated, so that the encoded data
analyses and data handling are completely unchanged. The
underpinning model is lightweight composition of fine-grained
operations on data, coupled together by data streams that
use the lowest cost technology available. These fine-grained
workflows are locally interpreted during development and
mapped to multiple nodes and systems such as MPI and Storm
for production.
We explain why such an approach is becoming more essential
in order that data-driven research can innovate rapidly and
exploit the growing wealth of data while adapting to current
technical trends. We show how provenance management is
provided to improve understanding and reproducibility, and
how a registry supports consistency and sharing. Three application
domains are reported and measurements on multiple
infrastructures show the optimisations achieved. Finally we
present the next steps to achieve scalability and performance.
Original language | English |
---|---|
Title of host publication | Proceedings of 11th IEEE eScience 2015, Munich, Germany, September 1-4 |
Publisher | Institute of Electrical and Electronics Engineers |
Pages | 454 - 464 |
Number of pages | 11 |
DOIs | |
Publication status | Published - 30 Sept 2015 |
Fingerprint
Dive into the research topics of 'dispel4py: An User-friendly Framework for Describing eScience Applications'. Together they form a unique fingerprint.Projects
- 3 Finished
-
The Terra-correlator: A computing facility for massive re
Main, I. (Principal Investigator)
14/11/13 → 14/10/15
Project: Research
-
Virtual Earthquake and seismology Research Community in Europe e-science environment (VERCE)
Atkinson, M. (Principal Investigator) & Parsons, M. (Co-investigator)
1/10/11 → 30/09/15
Project: Research
-
NRP: National eScience Centre Research Platform
Robertson, D. (Principal Investigator)
1/09/08 → 28/02/14
Project: Research