A Hybrid Adaptive Leadership GWO Optimization with Category Gradient Boosting on Decision Trees Algorithm for Credit Risk Control Classification
With the rapid development of financial economy in our country, credit business has become the main business of banks and financial companies. The quality of customer credit directly affects the business performance of financial companies. The traditional risk control model cannot identify good cust...
Saved in:
Main Authors: | , , |
---|---|
Format: | Thesis |
Language: | English |
Published: |
International Information and Engineering Technology Association
2024
|
Subjects: | |
Online Access: | http://ir.unimas.my/id/eprint/46411/3/THESIS%20PHD_Chen%20Suihai.pdf http://ir.unimas.my/id/eprint/46411/ https://iieta.org/journals/ijsse/paper/10.18280/ijsse.140429 https://doi.org/10.18280/ijsse.140429 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Malaysia Sarawak |
Language: | English |
id |
my.unimas.ir-46411 |
---|---|
record_format |
eprints |
spelling |
my.unimas.ir-464112024-10-22T01:21:19Z http://ir.unimas.my/id/eprint/46411/ A Hybrid Adaptive Leadership GWO Optimization with Category Gradient Boosting on Decision Trees Algorithm for Credit Risk Control Classification Suihai, Chen Chih How, Bong Po Chan, Chiu QA75 Electronic computers. Computer science With the rapid development of financial economy in our country, credit business has become the main business of banks and financial companies. The quality of customer credit directly affects the business performance of financial companies. The traditional risk control model cannot identify good customers and bad customers well, so a new high precision risk control algorithm model is needed to solve this problem. CatBoost algorithm has high accuracy and takes into account the efficiency of the model, which can effectively solve such problems. However, compared with the traditional risk control algorithm (logistic regression algorithm), CatBoost algorithm also needs to have the advantages of high efficiency, low algorithm complexity and strong interpretable ability. Therefore, this study first reduces the dimension of the variable features (deleting the variable features with low correlation), and then uses two different datasets for experimental comparison and verification, which proves that the dimension reduction can improve the efficiency of the algorithm. Secondly, an improved CatBoost algorithm (EBGWO-CatBoost) was proposed, which was a combination of improved GWO algorithm (EBGWO) and CatBoost algorithm, and the optimized GWO algorithm was used to offset the defects of CatBoost algorithm in parameter tuning. The combined EBGWO-Catboost algorithm model is structured into four distinct steps. Initially, the Grey Wolf algorithm (GWO) is leveraged to exploit its advantages in identifying optimal parameters. Subsequently, the optimal parameters of the CatBoost algorithm are determined to streamline computational resource usage and reduce model complexity. However, optimizing the balance between exploration and exploitation within the GWO algorithm itself is imperative. Hence, the second step entails enhancing the GWO algorithm, referred to as the EBGWO algorithm. This enhanced version of the Grey Wolf Optimization algorithm possesses robust global search capabilities and helps alleviate some of the local convergence issues inherent in the original GWO algorithm. It can effectively enhance the predictive accuracy and execution speed of the CatBoost algorithm model. The third step involves applying the new algorithm to the risk control model for testing and comparison, resulting in the conclusion that the model established by the EBGWO-Catboost algorithm exhibits more advantages compared to models built by other algorithms. Thirdly, this study uses SHAP framework to improve the interpretability of the new algorithm (EBGWO-CatBoost), and solves the problem of the weak interpretability of the new algorithm. Finally, a summary of the entire document is provided, along with suggestions for future work and extensions. International Information and Engineering Technology Association 2024 Thesis PeerReviewed text en http://ir.unimas.my/id/eprint/46411/3/THESIS%20PHD_Chen%20Suihai.pdf Suihai, Chen and Chih How, Bong and Po Chan, Chiu (2024) A Hybrid Adaptive Leadership GWO Optimization with Category Gradient Boosting on Decision Trees Algorithm for Credit Risk Control Classification. PhD thesis, Universiti Malaysia Sarawak. https://iieta.org/journals/ijsse/paper/10.18280/ijsse.140429 https://doi.org/10.18280/ijsse.140429 |
institution |
Universiti Malaysia Sarawak |
building |
Centre for Academic Information Services (CAIS) |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Malaysia Sarawak |
content_source |
UNIMAS Institutional Repository |
url_provider |
http://ir.unimas.my/ |
language |
English |
topic |
QA75 Electronic computers. Computer science |
spellingShingle |
QA75 Electronic computers. Computer science Suihai, Chen Chih How, Bong Po Chan, Chiu A Hybrid Adaptive Leadership GWO Optimization with Category Gradient Boosting on Decision Trees Algorithm for Credit Risk Control Classification |
description |
With the rapid development of financial economy in our country, credit business has become the main business of banks and financial companies. The quality of customer credit directly affects the business performance of financial companies. The traditional risk control model cannot identify good customers and bad customers well, so a new high precision risk control algorithm model is needed to solve this problem. CatBoost algorithm has high accuracy and takes into account the efficiency of the model, which can effectively solve such problems. However, compared with the traditional risk control algorithm (logistic regression algorithm), CatBoost algorithm also needs to have the advantages of high efficiency, low algorithm complexity and strong interpretable ability. Therefore, this study first reduces the dimension of the variable features (deleting the variable features with low correlation), and then uses two different datasets for experimental comparison and verification, which proves that the dimension reduction can improve the efficiency of the algorithm. Secondly, an improved CatBoost algorithm (EBGWO-CatBoost) was proposed, which was a combination of improved GWO algorithm (EBGWO) and CatBoost algorithm, and the optimized GWO algorithm was used to offset the defects of CatBoost algorithm in parameter tuning. The combined EBGWO-Catboost algorithm model is structured into four distinct steps. Initially, the Grey Wolf algorithm (GWO) is leveraged to exploit its advantages in identifying optimal parameters. Subsequently, the optimal parameters of the CatBoost algorithm are determined to streamline computational resource usage and reduce model complexity. However, optimizing the balance between exploration and exploitation within the GWO algorithm itself is imperative. Hence, the second step entails enhancing the GWO algorithm, referred to as the EBGWO algorithm. This enhanced version of the Grey Wolf Optimization algorithm possesses robust global search capabilities and helps alleviate some of the local convergence issues inherent in the original GWO algorithm. It can effectively enhance the predictive accuracy and execution speed of the CatBoost algorithm model. The third step involves applying the new algorithm to the risk control model for testing and comparison, resulting in the conclusion that the model established by the EBGWO-Catboost algorithm exhibits more advantages compared to models built by other algorithms. Thirdly, this study uses SHAP framework to improve the interpretability of the new algorithm (EBGWO-CatBoost), and solves the problem of the weak interpretability of the new algorithm. Finally, a summary of the entire document is provided, along with suggestions for future work and extensions. |
format |
Thesis |
author |
Suihai, Chen Chih How, Bong Po Chan, Chiu |
author_facet |
Suihai, Chen Chih How, Bong Po Chan, Chiu |
author_sort |
Suihai, Chen |
title |
A Hybrid Adaptive Leadership GWO Optimization with Category Gradient Boosting on Decision Trees Algorithm for Credit Risk Control Classification |
title_short |
A Hybrid Adaptive Leadership GWO Optimization with Category Gradient Boosting on Decision Trees Algorithm for Credit Risk Control Classification |
title_full |
A Hybrid Adaptive Leadership GWO Optimization with Category Gradient Boosting on Decision Trees Algorithm for Credit Risk Control Classification |
title_fullStr |
A Hybrid Adaptive Leadership GWO Optimization with Category Gradient Boosting on Decision Trees Algorithm for Credit Risk Control Classification |
title_full_unstemmed |
A Hybrid Adaptive Leadership GWO Optimization with Category Gradient Boosting on Decision Trees Algorithm for Credit Risk Control Classification |
title_sort |
hybrid adaptive leadership gwo optimization with category gradient boosting on decision trees algorithm for credit risk control classification |
publisher |
International Information and Engineering Technology Association |
publishDate |
2024 |
url |
http://ir.unimas.my/id/eprint/46411/3/THESIS%20PHD_Chen%20Suihai.pdf http://ir.unimas.my/id/eprint/46411/ https://iieta.org/journals/ijsse/paper/10.18280/ijsse.140429 https://doi.org/10.18280/ijsse.140429 |
_version_ |
1814942167846617088 |