The added value of Facebook friends data in event attendance prediction

Matthias Bogaert, Michel Ballings*, Dirk Van Den Poel

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

This paper seeks to assess the added value of a Facebook user's friends data in event attendance prediction over and above user data. For this purpose we gathered data of users that have liked an anonymous European soccer team on Facebook. In addition we obtained data from all their friends. In order to assess the added value of friends data we have built two models for five different algorithms (Logistic Regression, Random Forest, Adaboost, Neural Networks and Naive Bayes). The baseline model contained only user data and the augmented model contained both user and friends data. We employed five times two-fold cross-validation and the Wilcoxon signed rank test to validate our findings. The results suggest that the inclusion of friends data in our predictive model increases the area under the receiver operating characteristic curve (AUC). Out of five algorithms, the increase is significant for three algorithms, marginally significant for one algorithm, and not significant for one algorithm. The increase in AUC ranged from 0.21%-points to 0.82%-points. The analyses show that a top predictor is the number of friends that are attending the focal event. To the best of our knowledge this is the first study that evaluates the added value of friends network data over and above user data in event attendance prediction on Facebook. These findings clearly indicate that including network data in event prediction models is a viable strategy for improving model performance.

Original languageEnglish
Pages (from-to)26-34
Number of pages8
JournalDecision Support Systems
Volume82
DOIs
Publication statusPublished - 1 Feb 2016

Keywords

  • events
  • Facebook
  • network data
  • predictive models
  • social media

Fingerprint

Dive into the research topics of 'The added value of Facebook friends data in event attendance prediction'. Together they form a unique fingerprint.

Cite this