F-IVM: Analytics over Relational Databases under Updates

Ahmet Kara, Milos Nikolic, Dan Olteanu, Haozhe Zhang

Research output: Contribution to journalArticlepeer-review

Abstract

This article describes F-IVM, a unified approach for maintaining analytics over changing relational data. We exemplify its versatility in four disciplines: processing queries with group-by aggregates and joins; learning linear regression models using the covariance matrix of the input features; building Chow-Liu trees using pairwise mutual information of the input features; and matrix chain multiplication. F-IVM has three main ingredients: higher-order incremental view maintenance; factorized computation; and ring abstraction. F-IVM reduces the maintenance of a task to that of a hierarchy of simple views. Such views are functions mapping keys, which are tuples of input values, to payloads, which are elements from a ring. F-IVM supports efficient factorized computation over keys, payloads, and updates. It treats uniformly seemingly disparate tasks: While in the key space, all tasks require general joins and variable marginalization, in the payload space, tasks differ in the definition of the sum and product ring operations. We implemented F-IVM on top of DBToaster and show that it can outperform classical first-order and fully recursive higher-order incremental view maintenance by orders of magnitude while using less memory.
Original languageEnglish
Pages (from-to)903–929
Number of pages27
JournalVLDB Journal
Volume33
DOIs
Publication statusPublished - 14 Nov 2023

Keywords / Materials (for Non-textual outputs)

  • incremental view maintenance
  • factorized databases
  • covariance matrix
  • learning linear regression models
  • mutual information
  • Chow-Liu tree
  • commutative ring

Fingerprint

Dive into the research topics of 'F-IVM: Analytics over Relational Databases under Updates'. Together they form a unique fingerprint.

Cite this