Distributed processing of elevation data by means of apache hadoop in a small cluster

Jitka Komarkova, Jakub Spidlen, Devanjan Bhattacharya, Oldrich Horak

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract / Description of output

Geoinformation technologies require fast processing of high and quickly increasing volumes of all types of spatial data. Parallel computational approach and distributed systems represent technologies which are able to provide required services, with reasonable costs. MapReduce is one example of such approach. It has been successfully implemented in large clusters in several instances. The applications include spatial and imagery data processing. The contribution deals with its implementation and operational performance using only a very small cluster (consisting of a few commodity personal computers) to process large-volume spatial data. Open-source implementation of MapReduce, named, Apache Hadoop, is used. The contribution is focused on a low-price solution and it deals with speed of processing and distribution of processed files. Authors run several experiments to evaluate the benefit of distributed data processing in a small-sized cluster and to find possible limitations. Size of processed files and number of processed values is used as the most important criteria for performance evaluation. Point elevation data were used during the experiments.

Original languageEnglish
Title of host publicationICSOFT 2013 - Proceedings of the 8th International Joint Conference on Software Technologies
Pages340-344
Number of pages5
Publication statusPublished - 2013
Event8th International Joint conference on Software Technologies, ICSOFT 2013 - Reykjavik, Iceland
Duration: 29 Jul 201331 Jul 2013

Publication series

NameICSOFT 2013 - Proceedings of the 8th International Joint Conference on Software Technologies

Conference

Conference8th International Joint conference on Software Technologies, ICSOFT 2013
Country/TerritoryIceland
CityReykjavik
Period29/07/1331/07/13

Keywords / Materials (for Non-textual outputs)

  • Apache hadoop
  • distributed processing
  • elevation data
  • small cluster

Fingerprint

Dive into the research topics of 'Distributed processing of elevation data by means of apache hadoop in a small cluster'. Together they form a unique fingerprint.

Cite this