Abstract / Description of output
This paper develops a method for handling two-class classification problems with highly unbalanced class sizes and misclassification costs. When the class sizes are highly unbalanced and the minority class represents a rare event, conventional classification methods tend to strongly favour the majority class, resulting in very low detection of the minority class. A method is proposed to determine the optimal cut-off for asymmetric misclassification costs and for unbalanced class sizes. Monte Carlo simulations show that this proposal performs better than the method based on the notion of classification accuracy. Finally, the proposed method is applied to empirical data on Italian small and medium enterprises to classify them into default and non-default groups.
Original language | English |
---|---|
Pages (from-to) | 1678-1693 |
Journal | Journal of Applied Statistics |
Volume | 41 |
Issue number | 8 |
DOIs | |
Publication status | Published - 10 Feb 2014 |