Аннотация:Credit scoring is a fundamental and one of the most complex tasks that financial institutions have to deal with. Commonly this problem is considered as default probability prediction for the lenders counterparts. Knowing predictors that significantly contribute to default prediction has recently emerged as a crucial issue of credit risk analysis. Default prediction and default predictor selection are two related issues, but many existing approaches address them separately. In this paper a unified procedure is proposed that is based on the regularization approach with logistic regression as an underlying model, which simultaneously selects the default predictors and optimizes all the parameters within the model. We give the strong probabilistic statement of shrinkage criterion for features selection.
The proposed regularization do not corrupt the relevant predictors, can select correlated predictors into model, gives stable subset of relevant features. Experimental results show that the proposed framework is competitive on both artificial data and publicly available data sets.