The benefits of service choreography for data-intensive computing

Adam Barker*, Paolo Besana, David Robertson, Jon B. Weissman

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract / Description of output

As the number of services and the size of data involved in workflows increases, centralised orchestration techniques are reaching the limits of scalability. In the classic orchestration model, all data pass through a centralised engine, which results in unnecessary data transfer, wasted bandwidth and the engine to become a bottleneck to the execution of a workflow. Choreography techniques, although more complex to model offer a decentralised alternative and are the optimal architecture for data-centric workflows; data are passed directly to where they are required, at the next service in the workflow. While orchestration is the dominant architectural approach, there are relatively few choreography languages and even fewer concrete implementations. This papers contributions are twofold. Firstly we argue the case for choreography in data-intensive computing, and demonstrate through workflow patterns the advantages in terms of scalability when a choreography architecture is adopted. Secondly we introduce the Light Weight Coordination Calculus (LCC), a type of process calculus used to formally define choreographies, and the OpenKnowledge framework, a choreography-based architecture, providing the functionality for peers to coordinate in an open peer-to-peer system. Through LCC and the OpenKnowledge framework we practically demonstrate how choreography can be achieved in a lightweight manner with a comparatively simple process language.
Original languageEnglish
Title of host publicationProceedings of the 7th international workshop on Challenges of large applications in distributed environments
PublisherACM
Pages1-10
Number of pages10
ISBN (Print)9781605585888
DOIs
Publication statusPublished - 2009
Event7th International Workshop on Challenges of Large Applications in Distributed Environments, CLADE'09, Co-located with the 2009 International Symposium on High Performance Distributed Computing Conference - Garching, Germany
Duration: 9 Jun 200910 Jun 2009

Conference

Conference7th International Workshop on Challenges of Large Applications in Distributed Environments, CLADE'09, Co-located with the 2009 International Symposium on High Performance Distributed Computing Conference
Abbreviated titleHPDC'09
Country/TerritoryGermany
CityGarching
Period9/06/0910/06/09

Keywords / Materials (for Non-textual outputs)

  • algorithms
  • design

Fingerprint

Dive into the research topics of 'The benefits of service choreography for data-intensive computing'. Together they form a unique fingerprint.

Cite this