A credit risk model with small sample data based on G-XGBoost

Article


Li, J., Liu, H., Yang, Z. and Han, L. 2021. A credit risk model with small sample data based on G-XGBoost. Applied Artificial Intelligence. 35 (15), pp. 1550-1566. https://doi.org/10.1080/08839514.2021.1987707
TypeArticle
TitleA credit risk model with small sample data based on G-XGBoost
AuthorsLi, J., Liu, H., Yang, Z. and Han, L.
Abstract

Currently existing credit risk models, e.g., Scoring Card and Extreme Gradient Boosting (XGBoost), usually have requirements for the capacity of modeling samples. The small sample size may result in the adverse outcomes for the trained models which may neither achieve the expected accuracy nor distinguish risks well. On the other hand, data acquisition can be difficult and restricted due to data protection regulations. In view of the above dilemma, this paper applies Generative Adversarial Nets (GAN) to the construction of small and micro enterprises (SMEs) credit risk model, and proposes a novel training method, namely G-XGBoost, based on the XGBoost model. A few batches of real data are selected to train GAN. When the generative network reaches Nash equilibrium, the network is used to generate pseudo data with the same distribution. The pseudo data is then combined with real data to form an amplified sample set. The amplified sample set is used to train XGBoost for credit risk prediction. The feasibility and advantages of the G-XGBoost model are demonstrated by comparing with the XGBoost model.

PublisherTaylor and Francis
JournalApplied Artificial Intelligence
ISSN0883-9514
Electronic1087-6545
Publication dates
Online28 Oct 2021
Print15 Dec 2021
Publication process dates
Deposited01 Feb 2022
Submitted07 Aug 2020
Accepted27 Sep 2021
Output statusPublished
Publisher's version
License
File Access Level
Open
Copyright Statement

© 2021 The Author(s). Published with license by Taylor & Francis Group, LLC.
This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives License (http://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited, and is not altered, transformed, or built upon in any way.

Digital Object Identifier (DOI)https://doi.org/10.1080/08839514.2021.1987707
LanguageEnglish
Permalink -

https://repository.mdx.ac.uk/item/89q78

Download files

  • 33
    total views
  • 28
    total downloads
  • 3
    views this month
  • 2
    downloads this month

Export as