Experiences of Using WDumper to Create Topical Subsets from Wikidata

Research output: Chapter in Book/Report/Conference proceedingConference contribution


Wikidata is a general-purpose knowledge graph covering a wide variety of topics with content being crowd-sourced through an open wiki. There are now over 90M interrelated data items in Wikidata which are accessible through a public query endpoint and data dumps. However, execution timeout limits and the size of data dumps make it difficult to use the data. The creation of arbitrary topical subsets of Wikidata, where only the relevant data is kept, would enable reuse of that data with the benefits of cost reduction, ease of access, and flexibility. In this paper, we provide a working definition for topical subsets over the Wikidata Knowledge Graph and evaluate a third-party tool (WDumper) to extract these topical subsets from Wikidata.
Original languageEnglish
Title of host publicationProceedings of the 2nd International Workshop on Knowledge Graph Construction co-located with 18th Extended Semantic Web Conference (ESWC 2021)
EditorsD. Chaves-Fraga, A. Dimou, P. Heyvaert, F. Priyatna, J. Sequeda
Publication statusPublished - 2 Jun 2021
EventSecond International Workshop On Knowledge Graph Construction: Co-located with the ESWC 2021 - Hersonissos, Greece
Duration: 6 Jun 2021 → …
Conference number: 2

Publication series

NameCEUR Workshop Proceedings
ISSN (Electronic)1613-0073


WorkshopSecond International Workshop On Knowledge Graph Construction
Abbreviated titleKGCW 2021
Period6/06/21 → …
OtherOnline and in-person event
Internet address


  • knowledge graph subsetting
  • topical subset
  • Wdumper
  • Wikidata


Dive into the research topics of 'Experiences of Using WDumper to Create Topical Subsets from Wikidata'. Together they form a unique fingerprint.

Cite this