A Hybrid Adaptive Leadership GWO Optimization with Category Gradient Boosting on Decision Trees Algorithm for Credit Risk Control Classification

With the rapid development of financial economy in our country, credit business has become the main business of banks and financial companies. The quality of customer credit directly affects the business performance of financial companies. The traditional risk control model cannot identify good cust...

Full description

Saved in:
Bibliographic Details
Main Authors: Suihai, Chen, Chih How, Bong, Po Chan, Chiu
Format: Thesis
Language:English
Published: International Information and Engineering Technology Association 2024
Subjects:
Online Access:http://ir.unimas.my/id/eprint/46411/3/THESIS%20PHD_Chen%20Suihai.pdf
http://ir.unimas.my/id/eprint/46411/
https://iieta.org/journals/ijsse/paper/10.18280/ijsse.140429
https://doi.org/10.18280/ijsse.140429
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Malaysia Sarawak
Language: English
id my.unimas.ir-46411
record_format eprints
spelling my.unimas.ir-464112024-10-22T01:21:19Z http://ir.unimas.my/id/eprint/46411/ A Hybrid Adaptive Leadership GWO Optimization with Category Gradient Boosting on Decision Trees Algorithm for Credit Risk Control Classification Suihai, Chen Chih How, Bong Po Chan, Chiu QA75 Electronic computers. Computer science With the rapid development of financial economy in our country, credit business has become the main business of banks and financial companies. The quality of customer credit directly affects the business performance of financial companies. The traditional risk control model cannot identify good customers and bad customers well, so a new high precision risk control algorithm model is needed to solve this problem. CatBoost algorithm has high accuracy and takes into account the efficiency of the model, which can effectively solve such problems. However, compared with the traditional risk control algorithm (logistic regression algorithm), CatBoost algorithm also needs to have the advantages of high efficiency, low algorithm complexity and strong interpretable ability. Therefore, this study first reduces the dimension of the variable features (deleting the variable features with low correlation), and then uses two different datasets for experimental comparison and verification, which proves that the dimension reduction can improve the efficiency of the algorithm. Secondly, an improved CatBoost algorithm (EBGWO-CatBoost) was proposed, which was a combination of improved GWO algorithm (EBGWO) and CatBoost algorithm, and the optimized GWO algorithm was used to offset the defects of CatBoost algorithm in parameter tuning. The combined EBGWO-Catboost algorithm model is structured into four distinct steps. Initially, the Grey Wolf algorithm (GWO) is leveraged to exploit its advantages in identifying optimal parameters. Subsequently, the optimal parameters of the CatBoost algorithm are determined to streamline computational resource usage and reduce model complexity. However, optimizing the balance between exploration and exploitation within the GWO algorithm itself is imperative. Hence, the second step entails enhancing the GWO algorithm, referred to as the EBGWO algorithm. This enhanced version of the Grey Wolf Optimization algorithm possesses robust global search capabilities and helps alleviate some of the local convergence issues inherent in the original GWO algorithm. It can effectively enhance the predictive accuracy and execution speed of the CatBoost algorithm model. The third step involves applying the new algorithm to the risk control model for testing and comparison, resulting in the conclusion that the model established by the EBGWO-Catboost algorithm exhibits more advantages compared to models built by other algorithms. Thirdly, this study uses SHAP framework to improve the interpretability of the new algorithm (EBGWO-CatBoost), and solves the problem of the weak interpretability of the new algorithm. Finally, a summary of the entire document is provided, along with suggestions for future work and extensions. International Information and Engineering Technology Association 2024 Thesis PeerReviewed text en http://ir.unimas.my/id/eprint/46411/3/THESIS%20PHD_Chen%20Suihai.pdf Suihai, Chen and Chih How, Bong and Po Chan, Chiu (2024) A Hybrid Adaptive Leadership GWO Optimization with Category Gradient Boosting on Decision Trees Algorithm for Credit Risk Control Classification. PhD thesis, Universiti Malaysia Sarawak. https://iieta.org/journals/ijsse/paper/10.18280/ijsse.140429 https://doi.org/10.18280/ijsse.140429
institution Universiti Malaysia Sarawak
building Centre for Academic Information Services (CAIS)
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaysia Sarawak
content_source UNIMAS Institutional Repository
url_provider http://ir.unimas.my/
language English
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Suihai, Chen
Chih How, Bong
Po Chan, Chiu
A Hybrid Adaptive Leadership GWO Optimization with Category Gradient Boosting on Decision Trees Algorithm for Credit Risk Control Classification
description With the rapid development of financial economy in our country, credit business has become the main business of banks and financial companies. The quality of customer credit directly affects the business performance of financial companies. The traditional risk control model cannot identify good customers and bad customers well, so a new high precision risk control algorithm model is needed to solve this problem. CatBoost algorithm has high accuracy and takes into account the efficiency of the model, which can effectively solve such problems. However, compared with the traditional risk control algorithm (logistic regression algorithm), CatBoost algorithm also needs to have the advantages of high efficiency, low algorithm complexity and strong interpretable ability. Therefore, this study first reduces the dimension of the variable features (deleting the variable features with low correlation), and then uses two different datasets for experimental comparison and verification, which proves that the dimension reduction can improve the efficiency of the algorithm. Secondly, an improved CatBoost algorithm (EBGWO-CatBoost) was proposed, which was a combination of improved GWO algorithm (EBGWO) and CatBoost algorithm, and the optimized GWO algorithm was used to offset the defects of CatBoost algorithm in parameter tuning. The combined EBGWO-Catboost algorithm model is structured into four distinct steps. Initially, the Grey Wolf algorithm (GWO) is leveraged to exploit its advantages in identifying optimal parameters. Subsequently, the optimal parameters of the CatBoost algorithm are determined to streamline computational resource usage and reduce model complexity. However, optimizing the balance between exploration and exploitation within the GWO algorithm itself is imperative. Hence, the second step entails enhancing the GWO algorithm, referred to as the EBGWO algorithm. This enhanced version of the Grey Wolf Optimization algorithm possesses robust global search capabilities and helps alleviate some of the local convergence issues inherent in the original GWO algorithm. It can effectively enhance the predictive accuracy and execution speed of the CatBoost algorithm model. The third step involves applying the new algorithm to the risk control model for testing and comparison, resulting in the conclusion that the model established by the EBGWO-Catboost algorithm exhibits more advantages compared to models built by other algorithms. Thirdly, this study uses SHAP framework to improve the interpretability of the new algorithm (EBGWO-CatBoost), and solves the problem of the weak interpretability of the new algorithm. Finally, a summary of the entire document is provided, along with suggestions for future work and extensions.
format Thesis
author Suihai, Chen
Chih How, Bong
Po Chan, Chiu
author_facet Suihai, Chen
Chih How, Bong
Po Chan, Chiu
author_sort Suihai, Chen
title A Hybrid Adaptive Leadership GWO Optimization with Category Gradient Boosting on Decision Trees Algorithm for Credit Risk Control Classification
title_short A Hybrid Adaptive Leadership GWO Optimization with Category Gradient Boosting on Decision Trees Algorithm for Credit Risk Control Classification
title_full A Hybrid Adaptive Leadership GWO Optimization with Category Gradient Boosting on Decision Trees Algorithm for Credit Risk Control Classification
title_fullStr A Hybrid Adaptive Leadership GWO Optimization with Category Gradient Boosting on Decision Trees Algorithm for Credit Risk Control Classification
title_full_unstemmed A Hybrid Adaptive Leadership GWO Optimization with Category Gradient Boosting on Decision Trees Algorithm for Credit Risk Control Classification
title_sort hybrid adaptive leadership gwo optimization with category gradient boosting on decision trees algorithm for credit risk control classification
publisher International Information and Engineering Technology Association
publishDate 2024
url http://ir.unimas.my/id/eprint/46411/3/THESIS%20PHD_Chen%20Suihai.pdf
http://ir.unimas.my/id/eprint/46411/
https://iieta.org/journals/ijsse/paper/10.18280/ijsse.140429
https://doi.org/10.18280/ijsse.140429
_version_ 1814942167846617088