TY - GEN
T1 - Building the Seshat ontology for a global history databank
AU - Brennan, Rob
AU - Feeney, Kevin
AU - Mendel-Gleason, Gavin
AU - Bozic, Bojan
AU - Turchin, Peter
AU - Whitehouse, Harvey
AU - Francois, Pieter
AU - Currie, Thomas E.
AU - Grohmann, Stephanie
N1 - Funding Information:
This work was supported by a John Templeton Foundation grant, “Axial-Age Religions and the Z-Curve of Human Egalitarianism,” a Tricoastal Foundation grant, “The Deep Roots of the Modern World: The Cultural Evolution of Economic Growth and Political Stability,” an ESRC Large Grant, “Ritual, Community, and Conflict” (REF RES-060-25-0085), European Union Horizon 2020 research and innovation programme (grant agreement No 644055 [ALIGNED, www.aligned-project.eu ]) and the ADAPT Centre for Digital Content Technology, SFI Research Centres Programme (Grant 13/RC/2106) co-funded by the European Regional Development Fund. We gratefully acknowledge the contributions of our team of research assistants, post-doctoral researchers, consultants, and experts. Additionally, we have received invaluable assistance from our collaborators. Please see the Seshat website for a full list of private donors, partners, experts, and consultants and their respective areas of expertise.
Funding Information:
‘Ritual, Community, and Conflict’ research project funded by the ESRC/UK ( http://www.esrc.ac.uk/research/our-research/ritual-community-and-conflict/ ).
Publisher Copyright:
© Springer International Publishing Switzerland 2016.
PY - 2016/5/14
Y1 - 2016/5/14
N2 - This paper describes OWL ontology re-engineering from the wiki-based social science codebook (thesaurus) developed by the Seshat: Global History Databank. The ontology describes human history as a set of over 1500 time series variables and supports variable uncertainty, temporal scoping, annotations and bibliographic references. The ontology was developed to transition from traditional social science data collection and storage techniques to an RDF-based approach. RDF supports automated generation of high usability data entry and validation tools, data quality management, incorporation of facts from the web of data and management of the data curation lifecycle. This ontology re-engineering exercise identified several pitfalls in modelling social science codebooks with semantic web technologies; provided insights into the practical application of OWL to complex, real-world modelling challenges; and has enabled the construction of new, RDF-based tools to support the large-scale Seshat data curation effort. The Seshat ontology is an exemplar of a set of ontology design patterns for modelling uncertainty or temporal bounds in standard RDF. Thus the paper provides guidance for deploying RDF in the social sciences. Within Seshat, OWL-based data quality management will assure the data is suitable for statistical analysis. Publication of Seshat as high-quality, linked open data will enable other researchers to build on it.
AB - This paper describes OWL ontology re-engineering from the wiki-based social science codebook (thesaurus) developed by the Seshat: Global History Databank. The ontology describes human history as a set of over 1500 time series variables and supports variable uncertainty, temporal scoping, annotations and bibliographic references. The ontology was developed to transition from traditional social science data collection and storage techniques to an RDF-based approach. RDF supports automated generation of high usability data entry and validation tools, data quality management, incorporation of facts from the web of data and management of the data curation lifecycle. This ontology re-engineering exercise identified several pitfalls in modelling social science codebooks with semantic web technologies; provided insights into the practical application of OWL to complex, real-world modelling challenges; and has enabled the construction of new, RDF-based tools to support the large-scale Seshat data curation effort. The Seshat ontology is an exemplar of a set of ontology design patterns for modelling uncertainty or temporal bounds in standard RDF. Thus the paper provides guidance for deploying RDF in the social sciences. Within Seshat, OWL-based data quality management will assure the data is suitable for statistical analysis. Publication of Seshat as high-quality, linked open data will enable other researchers to build on it.
KW - cliodynamics
KW - ontology design patterns
KW - ontology engineering
UR - http://www.scopus.com/inward/record.url?scp=84978969323&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-34129-3_42
DO - 10.1007/978-3-319-34129-3_42
M3 - Conference contribution
AN - SCOPUS:84978969323
SN - 9783319341286
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 693
EP - 708
BT - The Semantic Web
A2 - d’Aquin, Mathieu
A2 - Sack, Harald
A2 - Lange, Christoph
A2 - Lange, Christoph
A2 - Ponzetto, Simone Paolo
A2 - Blomqvist, Eva
A2 - Ghidini, Chiara
PB - Springer
T2 - 13th International Conference on Semantic Web, ESWC 2016
Y2 - 29 May 2016 through 2 June 2016
ER -