Projects per year
Abstract / Description of output
Named entity recognition for novel domains can be challenging in the absence of suitable training materials for machine-learning or lexicons and gazetteers for term look-up. We describe an approach that starts from a small, manually created word list of commodities traded in the nineteenth century, and then uses semantic web techniques to augment the list by an order of magnitude, drawing on data stored in DBpedia. This work was conducted during the Trading Consequences project on text mining and visualisation of historical documents for the study of global trading in the British empire.
Original language | English |
---|---|
Title of host publication | Proceedings of the 8th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH) |
Place of Publication | Gothenburg, Sweden |
Publisher | Association for Computational Linguistics |
Pages | 13-21 |
Number of pages | 9 |
Publication status | Published - 1 Apr 2014 |
Fingerprint
Dive into the research topics of 'Bootstrapping a historical commodities lexicon with SKOS and DBpedia'. Together they form a unique fingerprint.Projects
- 1 Finished