运筹与管理 ›› 2023, Vol. 32 ›› Issue (8): 137-144.DOI: 10.12005/orms.2023.0262

• 应用研究 • 上一篇    下一篇

基于改进BS-Stacking模型的个人信用风险评估方法研究

顾清华1,2,3, 宋思远1,2, 张新生3, 暴子旗1,2   

  1. 1.西安建筑科技大学 资源工程学院,陕西 西安 710055;
    2.西安市智慧工业感知计算与决策重点实验室,陕西 西安 710055;
    3.西安建筑科技大学 管理学院,陕西 西安 710055
  • 收稿日期:2021-06-30 出版日期:2023-08-25 发布日期:2023-09-22
  • 通讯作者: 顾清华(1981-),男,教授,博士生导师,研究方向:多目标优化,车辆调度和复杂系统建模与仿真。
  • 基金资助:
    国家社会科学基金项目(18XGL010);陕西省教育厅2020年度重点科学研究计划(20JT033)

Personal Credit Risk Assessment Based on Improved BS-Stacking

GU Qinghua1,2,3, SONG Siyuan1,2, ZHANG Xinsheng3, BAO Ziqi1,2   

  1. 1. School of Resources Engineering, Xi’an University of Architecture and Technology, Xi’an 710055, China;
    2. Xi’an Key Laboratory of Perceptual Computing and Decision Making for Intelligent Industry, Xi’an 710055, China;
    3. School of Management, Xi’an University of Architecture and Technology, Xi’an 710055, China
  • Received:2021-06-30 Online:2023-08-25 Published:2023-09-22

摘要: 在个人信用违约风险与日俱增的背景下,为了使企业准确识别个人信用风险,本文提出了基于改进BS-Stacking模型的个人信用风险评估方法。针对个人信用风险数据的特点,首先对数据使用改进后的Borderline SMOTE-2算法进行过采样处理,然后使用网格搜索算法对分类器进行参数寻优,为了寻找模型的最优组合,使用逻辑回归对基模型进行贡献度分析,从而确定Stacking模型。实验表明所提出模型与各类集成算法相比,在个人信用风险评估违约样本的识别率上以及稳定性等各类指标上均有最好表现,验证了模型的有效性。

关键词: 信用风险评估, 分类, Borderline SMOTE-2, 堆叠模型

Abstract: Credit risk is the core content of today’s risk management, and a lot of research has been done on practical models that support debtor credit assessment, pricing of credit risk instruments, measurement and control of credit risk exposure, and portfolio credit loss analysis. Personal credit risk refers to the possibility of default due to failure to repay debts or loans in time and in full for various reasons, and its degree directly affects the strength of credit. In the context of increasing personal credit default risk, in order to enable enterprises to accurately identify personal credit risks, this paper presents a personal credit risk assessment method based on the improved BS-Stacking model.
This paper obtains bank data related to individual credit risk from the German credit data set published by UCI. The data set has 1000 samples, 700 positive samples, 300 negative samples, and 24 indicator attributes for each original sample. According to the characteristics of the personal credit risk data, we first use the improved Borderline SMOTE 2 algorithm to oversampling the data, and further remove the noise points on the basis of strengthening the identification of the minority sample boundary area, so as to ensure the accurate prediction of the default sample. In addition, for the problems that the classifier in the Stacking algorithm has redundancy and may reduce the prediction performance, grid search is used for parameter adjustment and LR is proposed to analyze the contribution degree of individual learners to obtain the optimal combination of individual learners and achieve the optimal performance of the entire model.
In this study, Accuracy and AUC are used to measure the accuracy of the prediction, and Precision, Recall, F1 score, and Specificity are used to measure the validity of the model. In the initial algorithm, a total of8 models SVC, GBDT, RF, AdaBoost, XGBoost, LightGBM, KNN and LR are selected as individual learners, and LR is selected as the output of individual learners to train. After unbalanced algorithm processing and comparative experiments, the model after preliminary screening is formed. After analyzing the contribution degree of the base model and testing the whole model, the optimal combination model is obtained, and the performance of the integrated model reaches the optimal state. The experiment proves the effectiveness of the unbalanced algorithm and the integrated algorithm from many angles, and also shows that the algorithm can achieve high accuracy and robustness in personal credit risk assessment.

Key words: credit risk assessment, classification, Borderline SMOTE-2, Stacking model

中图分类号: