Abstract
Wikidata is a general-purpose knowledge graph covering a wide variety of topics with content being crowd-sourced through an open wiki. There are now over 90M interrelated data items in Wikidata which are accessible through a public query endpoint and data dumps. However, execution timeout limits and the size of data dumps make it difficult to use the data. The creation of arbitrary topical subsets of Wikidata, where only the relevant data is kept, would enable reuse of that data with the benefits of cost reduction, ease of access, and flexibility. In this paper, we provide a working definition for topical subsets over the Wikidata Knowledge Graph and evaluate a third-party tool (WDumper) to extract these topical subsets from Wikidata.
Original language | English |
---|---|
Title of host publication | Proceedings of the 2nd International Workshop on Knowledge Graph Construction co-located with 18th Extended Semantic Web Conference (ESWC 2021) |
Editors | D. Chaves-Fraga, A. Dimou, P. Heyvaert, F. Priyatna, J. Sequeda |
Publisher | CEUR-WS |
Pages | 1-15 |
Volume | 2873 |
Publication status | Published - 2 Jun 2021 |
Event | Second International Workshop On Knowledge Graph Construction: Co-located with the ESWC 2021 - Hersonissos, Greece Duration: 6 Jun 2021 → … Conference number: 2 https://kg-construct.github.io/workshop/2021/ |
Publication series
Name | CEUR Workshop Proceedings |
---|---|
Publisher | CEUR-WS |
ISSN (Electronic) | 1613-0073 |
Workshop
Workshop | Second International Workshop On Knowledge Graph Construction |
---|---|
Abbreviated title | KGCW 2021 |
Country/Territory | Greece |
City | Hersonissos |
Period | 6/06/21 → … |
Other | Online and in-person event |
Internet address |
Keywords
- knowledge graph subsetting
- topical subset
- Wdumper
- Wikidata