Edinburgh Research Explorer

Block as a Value for SQL over NoSQL

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Related Edinburgh Organisations

Open Access permissions

Open

Documents

  • Download as Adobe PDF

    Accepted author manuscript, 1 MB, PDF-document

  • Download as Adobe PDF

    Final published version, 1 MB, PDF-document

    Licence: Creative Commons: Attribution-NonCommercial-NoDerivatives (CC BY-NC-ND)

https://dl.acm.org/citation.cfm?id=3355536
Original languageEnglish
Title of host publicationProceedings of the VLDB Endowment
Subtitle of host publicationProceedings of the 45th International Conference on Very Large Data Bases, Los Angeles, California
Pages1153-1166
Number of pages14
Volume12
Edition10
ISBN (Electronic)2150-8097
DOIs
Publication statusPublished - 1 Jun 2019
Event45th International Conference on Very Large Data Bases - Los Angeles, United States
Duration: 26 Aug 201930 Aug 2019
http://vldb.org/2019/

Conference

Conference45th International Conference on Very Large Data Bases
Abbreviated titleVLDB 2019
CountryUnited States
CityLos Angeles
Period26/08/1930/08/19
Internet address

Abstract

This paper presents Zidian, a middleware for key-value (KV) stores to speed up SQL query evaluation over NoSQL. As opposed to common practice that takes a tuple id or primary key as key and the entire tuple as value, Zidian proposes a block-as-a-value model BaaV. BaaV represents a relation as keyed blocks (k, B), where k is a key of a block (a set) B of partial tuples. We extend relational algebra to BaaV. We show that under BaaV, Zidian substantially reduces data access and communication cost. We provide characterizations (sufficient and necessary conditions) for (a) result-preserving queries, i.e., queries covered by available BaaV stores, (b) scan-free queries, i.e., queries that can be evaluated without scanning any table, and (c) bounded queries, i.e., queries that can be answered by accessing a bounded amount of data. We show that in parallel processing, Zidian guarantees (a) no scans for scan-free queries, (b) bounded communication cost for bounded queries; and (c) parallel scalability, i.e., speedup when adding processors. Moreover, Zidian can be plugged into existing SQL-over-NoSQL systems and retains horizontal scalability. Using benchmark and real-life data, we empirically verify that Zidian improves existing SQL-overNoSQL systems by 2 orders of magnitude on average.

Event

45th International Conference on Very Large Data Bases

26/08/1930/08/19

Los Angeles, United States

Event: Conference

ID: 91150306