Avoiding the drunkard's search: Investigating collection strategies for building a Twitter dataset

Clare Llewellyn, Laura Cram, Adrian Favero

Research output: Chapter in Book/Report/Conference proceedingChapter (peer-reviewed)peer-review

Abstract

We investigate methods for collecting data to form an archive on the debate within Twitter surrounding the UK's inclusion in the EU. We use three strategies, gathering data using hashtags, extracting data from the random stream and collecting from users known to be discussing the debate. We explore the various bias in the resulting datasets.
Original languageEnglish
Title of host publicationJCDL '16 Proceedings of the 16th ACM/IEEE-CS on Joint Conference on Digital Libraries
Place of PublicationNew York
PublisherACM
Pages205-206
ISBN (Print)9781450342292
DOIs
Publication statusPublished - 19 Jun 2016

Fingerprint

Dive into the research topics of 'Avoiding the drunkard's search: Investigating collection strategies for building a Twitter dataset'. Together they form a unique fingerprint.

Cite this