Demonstration of OpenDBML, a framework for democratizing in-database machine learning

Mahdi Ghorbani, Amir Shaikhha

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract / Description of output

Machine learning over relational data has been used in several applications. The traditional approach of joining relations first and then training a model on the joined table is time-consuming and requires a significant amount of memory. Recent research has focused on in-database machine learning (in-DB ML) to address this issue; these methods train the models over relations without joining, resulting in a more efficient process. However, such systems have ad-hoc user interfaces and specific data formats, making them challenging to use. To address this problem, this paper presents OpenDBML, a framework for democratizing in-DB ML. OpenDBML offers a Python interface for multiple in-DB ML systems, a set of commonly used datasets, and the ability to add new datasets and in-DB ML systems via both Python and web interfaces. The paper also presents comprehensive demonstration scenarios to illustrate how to use OpenDBML effectively.
Original languageEnglish
Title of host publicationProceedings of the VLDB Endowment
PublisherACM
Pages3970-3973
Number of pages4
Volume16
Edition12
DOIs
Publication statusPublished - 1 Aug 2023
Event49th International Conference on Very Large Data Bases - Vancouver, Canada
Duration: 28 Aug 20231 Sept 2023

Publication series

NameProceedings of the VLDB Endowment
PublisherACM
ISSN (Print)2150-8097

Conference

Conference49th International Conference on Very Large Data Bases
Abbreviated titleVLDB 2023
Country/TerritoryCanada
CityVancouver
Period28/08/231/09/23

Fingerprint

Dive into the research topics of 'Demonstration of OpenDBML, a framework for democratizing in-database machine learning'. Together they form a unique fingerprint.

Cite this