Abstract / Description of output
Twitter-related studies often need to geo-locate Tweets or Twitter users, identifying their real-world geographic locations. As tweet-level geotagging remains rare, most prior work exploited tweet content, timezone and network information to inform geolocation, or else relied on off-the-shelf tools to geolocate users from location information in their user profiles. However, such user location metadata is not consistently structured, causing such tools to fail regularly, especially if a string contains multiple locations, or if locations are very fine-grained. We argue that user profile location (UPL) and tweet location need to be treated as distinct types of information from which differing inferences can be drawn. Here, we apply geoparsing to UPLs, and demonstrate how task performance can be improved by adapting our Edinburgh Geoparser, which was originally developed for processing English text. We present a detailed evaluation method and results, including inter-coder agreement. We demonstrate that the optimised geoparser can effectively extract and geo-reference multiple locations at different levels of granularity with an F1-score of around 0.90. We also illustrate how geoparsed UPLs can be exploited for international information trade studies and country-level sentiment analysis.
Original language | English |
---|---|
Title of host publication | Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016) |
Publisher | European Language Resources Association (ELRA) |
Pages | 3936-3944 |
Number of pages | 9 |
ISBN (Print) | 978-2-9517408-9-1 |
Publication status | Published - May 2016 |
Event | 10th edition of the Language Resources and Evaluation Conference - Portorož , Slovenia Duration: 23 May 2016 → 28 May 2016 http://lrec2016.lrec-conf.org/en/ http://www.lrec-conf.org/proceedings/lrec2016/index.html |
Conference
Conference | 10th edition of the Language Resources and Evaluation Conference |
---|---|
Abbreviated title | LREC 2016 |
Country/Territory | Slovenia |
City | Portorož |
Period | 23/05/16 → 28/05/16 |
Internet address |