Activities per year
Abstract / Description of output
This paper presents the new facilities provided in defoe, a parallel toolbox for querying a wealth of digitised newspapers and books at scale. defoe has been extended to work with further Natural Language Processing () tools such as the Edinburgh Geoparser, to store the preprocessed text in several storage facilities and to support different types of queries and analyses. We have also extended the collection of XML schemas supported by defoe, increasing the versatility of the tool for the analysis of digital historical textual data at scale. Finally, we have conducted several studies in which we worked with humanities and social science researchers who posed complex and interested questions to large-scale digital collections. Results shows that defoe allows researchers to conduct their studies and obtain results faster, while all the large-scale text mining complexity
is automatically handled by defoe.
is automatically handled by defoe.
Original language | English |
---|---|
Title of host publication | 2021 IEEE 17th International Conference on eScience (eScience) |
Publisher | Institute of Electrical and Electronics Engineers |
Pages | 21-29 |
Number of pages | 9 |
ISBN (Electronic) | 9781665403610 |
ISBN (Print) | 9781665447089 |
DOIs | |
Publication status | Published - 26 Oct 2021 |
Event | IEEE eScience 2021 - 17th IEEE eScience 2021 International Conference - University of Innsbruck , Innsbruck, Austria Duration: 20 Sept 2021 → 23 Sept 2021 https://www.escience2021.org |
Conference
Conference | IEEE eScience 2021 - 17th IEEE eScience 2021 International Conference |
---|---|
Abbreviated title | eScience 2021 |
Country/Territory | Austria |
City | Innsbruck |
Period | 20/09/21 → 23/09/21 |
Internet address |
Keywords / Materials (for Non-textual outputs)
- text mining
- distributed queries
- High Performance Computing
- XML schemas
- digital tools
- humanities research
Fingerprint
Dive into the research topics of 'Extending defoe for the efficient analysis of historical texts at scale'. Together they form a unique fingerprint.Activities
- 1 Oral presentation
-
'Scots for the masses'? Exploring the use of Scots in 19th century chapbooks
Sarah Van Eyndhoven (Speaker), Lisa Gotthard (Speaker) & Rosa Filgueira (Contributor)
Jun 2021Activity: Academic talk or presentation types › Oral presentation