Abstract
Graphs encode important structural properties of complex systems. Machine learning on graphs has therefore emerged as an important technique in research and applications. We present Karate Club - a Python framework combining more than 30 state-of-the-art graph mining algorithms. These unsupervised techniques make it easy to identify and represent common graph features. The primary goal of the package is to make community detection, node and whole graph embedding available to a wide audience of machine learning researchers and practitioners. Karate Club is designed with an emphasis on a consistent application interface, scalability, ease of use, sensible out of the box model behaviour, standardized dataset ingestion, and output generation. This paper discusses the design principles behind the framework with practical examples. We show Karate Club's efficiency in learning performance on a wide range of real world clustering problems and classification tasks along with supporting evidence of its competitive speed.
Original language | English |
---|---|
Title of host publication | Proceedings of the 29th ACM International Conference on Information & Knowledge Management |
Place of Publication | New York, NY, USA |
Publisher | Association for Computing Machinery (ACM) |
Pages | 3125–3132 |
ISBN (Print) | 9781450368599 |
DOIs | |
Publication status | Published - 19 Oct 2020 |
Event | 29th ACM International Conference on Information and Knowledge Management - Omline Conference Duration: 19 Oct 2020 → 23 Oct 2020 https://www.cikm2020.org/index.html |
Conference
Conference | 29th ACM International Conference on Information and Knowledge Management |
---|---|
Abbreviated title | CIKM 2020 |
City | Omline Conference |
Period | 19/10/20 → 23/10/20 |
Internet address |
Keywords / Materials (for Non-textual outputs)
- graph mining
- network embedding
- community detection
- graph embedding
- node embedding
- graph classification
- machine learning