Performance database: capturing data for optimizing distributed streaming workflows

Chee Sun Liew, Malcolm Atkinson, Radoslaw Ostrowski, Murray Cole, Jano I. van Hemert, Liangxiu Han

Research output: Contribution to journalArticlepeer-review

Abstract

The performance database (PDB) stores performance-related data gathered during workflow enactment. We argue that, by carefully understanding and manipulating these data, we can improve efficiency when enacting workflows. This paper describes the rationale behind the PDB, and proposes a systematic way to implement it. The prototype is built as part of the Advanced Data Mining and Integration Research for Europe project. We use workflows from real-world experiments to demonstrate the usage of PDB.
Original languageEnglish
Pages (from-to)3268-3284
Number of pages17
JournalPhilosophical Transactions A: Mathematical, Physical and Engineering Sciences
Volume369
Issue number1949
DOIs
Publication statusPublished - 2011

Fingerprint

Dive into the research topics of 'Performance database: capturing data for optimizing distributed streaming workflows'. Together they form a unique fingerprint.

Cite this