Abstract / Description of output
Forecasting methods are routinely employed to predict the outcome of competitive events (CEs) and to shed light on the factors that influence participants' winning prospects (e.g.; in sports events, political elections). Combining statistical models' forecasts, shown to be highly successful in other settings, has been neglected in CE prediction. Two particular difficulties arise when developing model-based composite forecasts of CE outcomes: the intensity of rivalry among contestants, and the strength/diversity trade-off among individual models. To overcome these challenges we propose a range of surrogate measures of event outcome to construct a heterogeneous set of base forecasts. To effectively extract the complementary information concealed within these predictions, we develop a novel pooling mechanism which accounts for competition among contestants: a stacking paradigm integrating conditional logit regression and log-likelihood-ratio-based forecast selection. Empirical results using data related to horseracing events demonstrate that: (i) base model strength and diversity are important when combining model-based predictions for CEs; (ii) average-based pooling, commonly employed elsewhere, may not be appropriate for CEs (because average-based pooling exclusively focuses on strength); and (iii) the proposed stacking ensemble provides statistically and economically accurate forecasts. These results have important implications for regulators of betting markets associated with CEs and in particular for the accurate assessment of market efficiency.
Original language | English |
---|---|
Pages (from-to) | 163-174 |
Number of pages | 12 |
Journal | European Journal of Operational Research |
Volume | 218 |
Issue number | 1 |
Early online date | 6 Nov 2011 |
DOIs | |
Publication status | Published - 1 Apr 2012 |
Keywords / Materials (for Non-textual outputs)
- Competitive event prediction
- Forecast combination
- Forecasting