TY - JOUR
T1 - Mid-price prediction based on machine learning methods with technical and quantitative indicators
AU - Ntakaris, Adamantios
AU - Kanniainen, Juho
AU - Gabbouj, Moncef
AU - Iosifidis, Alexandros
N1 - Publisher Copyright:
Copyright: © 2020 Ntakaris et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
PY - 2020/6/12
Y1 - 2020/6/12
N2 - Stock price prediction is a challenging task, in which machine learning methods have recently been successfully used. In this paper, we extract over 270 hand-crafted features (factors) inspired by technical indicators and quantitative analysis and test their validity on short-term mid-price movement prediction for Nordic TotalView-ITCH stocks. The suggested feature list represents one of the most extensive studies in the field of financial feature engineering. We focus on a wrapper feature selection method using entropy, least-mean squares, and linear discriminant analysis. We also introduce a novel quantitative feature based on adaptive logistic regression for online learning. The proposed feature is consistently selected as the first feature among a large number of indicators used in this study. We further examine the best combinations of features using a high-frequency limit order book Nordic database. Our results suggest that sorting methods and classifiers can be used in such a way that one can reach the best classification performance with a combination of only a few advanced hand-crafted features.
AB - Stock price prediction is a challenging task, in which machine learning methods have recently been successfully used. In this paper, we extract over 270 hand-crafted features (factors) inspired by technical indicators and quantitative analysis and test their validity on short-term mid-price movement prediction for Nordic TotalView-ITCH stocks. The suggested feature list represents one of the most extensive studies in the field of financial feature engineering. We focus on a wrapper feature selection method using entropy, least-mean squares, and linear discriminant analysis. We also introduce a novel quantitative feature based on adaptive logistic regression for online learning. The proposed feature is consistently selected as the first feature among a large number of indicators used in this study. We further examine the best combinations of features using a high-frequency limit order book Nordic database. Our results suggest that sorting methods and classifiers can be used in such a way that one can reach the best classification performance with a combination of only a few advanced hand-crafted features.
UR - http://www.scopus.com/inward/record.url?scp=85086523968&partnerID=8YFLogxK
U2 - 10.1371/journal.pone.0234107
DO - 10.1371/journal.pone.0234107
M3 - Article
C2 - 32530920
AN - SCOPUS:85086523968
VL - 15
JO - PLoS ONE
JF - PLoS ONE
SN - 1932-6203
IS - 6
M1 - e0234107
ER -