A novel noise-adapted two-layer ensemble model for credit scoring based on backflow learning

IEEE Access(2019)

引用 13|浏览2
暂无评分
摘要
Recently, the machine learning method and artificial intelligence algorithm have become increasingly important in classification problems, such as credit scoring. Building an ensemble learning model that has been proven to be typically more accurate and robust than individual classifiers, it is an important information management task of commercial banks and loan lenders. In this paper, a novel noise-adapted two-layer ensemble model for credit scoring based on backflow learning is proposed, in which five widely used base classifiers, i.e., extreme gradient boosting, gradient boosting decision tree, support vector machine, random forest, and linear discriminant analysis, are integrated. To amplify the strength and diversity of the base classifiers, a new backflow learning approach is proposed so that the base classifiers will relearn the misclassified data point. A final predictive result is obtained by fusing the prediction of all base classifiers through two-layer ensemble modeling. In addition, considering that noise data are a major problem that aggravates the accuracy of a predictive model, a new noise adaption approach based on the isolation forest algorithm is proposed to address noise data. It first calculates the outlier score of each data point to detect the noise data that are subsequently boosted in the training set to form the noise-adapted training set. Three credit datasets from the UCI machine learning repository are tested to compare the performance of the proposed model with those of other benchmark models. The experimental results prove that our proposed model outperforms other models by demonstrating satisfactory improvement in various performance measures.
更多
查看译文
关键词
Credit scoring, ensemble model, backflow learning, noise adaption, feature engineering, machine learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要