TY - GEN
T1 - Demonstration of OpenDBML, a framework for democratizing in-database machine learning
AU - Ghorbani, Mahdi
AU - Shaikhha, Amir
PY - 2023/8/1
Y1 - 2023/8/1
N2 - Machine learning over relational data has been used in several applications. The traditional approach of joining relations first and then training a model on the joined table is time-consuming and requires a significant amount of memory. Recent research has focused on in-database machine learning (in-DB ML) to address this issue; these methods train the models over relations without joining, resulting in a more efficient process. However, such systems have ad-hoc user interfaces and specific data formats, making them challenging to use. To address this problem, this paper presents OpenDBML, a framework for democratizing in-DB ML. OpenDBML offers a Python interface for multiple in-DB ML systems, a set of commonly used datasets, and the ability to add new datasets and in-DB ML systems via both Python and web interfaces. The paper also presents comprehensive demonstration scenarios to illustrate how to use OpenDBML effectively.
AB - Machine learning over relational data has been used in several applications. The traditional approach of joining relations first and then training a model on the joined table is time-consuming and requires a significant amount of memory. Recent research has focused on in-database machine learning (in-DB ML) to address this issue; these methods train the models over relations without joining, resulting in a more efficient process. However, such systems have ad-hoc user interfaces and specific data formats, making them challenging to use. To address this problem, this paper presents OpenDBML, a framework for democratizing in-DB ML. OpenDBML offers a Python interface for multiple in-DB ML systems, a set of commonly used datasets, and the ability to add new datasets and in-DB ML systems via both Python and web interfaces. The paper also presents comprehensive demonstration scenarios to illustrate how to use OpenDBML effectively.
UR - http://www.scopus.com/inward/record.url?scp=85174503084&partnerID=8YFLogxK
U2 - 10.14778/3611540.3611598
DO - 10.14778/3611540.3611598
M3 - Conference contribution
AN - SCOPUS:85174503084
VL - 16
T3 - Proceedings of the VLDB Endowment
SP - 3970
EP - 3973
BT - Proceedings of the VLDB Endowment
PB - ACM
T2 - 49th International Conference on Very Large Data Bases
Y2 - 28 August 2023 through 1 September 2023
ER -