Data analytics and predictive modelling of credit risk of bank customers
To reduce losses and increase profits, financial organizations must evaluate credit risk. In this article, we propose an ensemble model for assessing credit risk that combines the methods of neural networks and different machine-learning models. The dataset used for this research is obtained from th...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/167777 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-167777 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1677772023-07-07T15:42:53Z Data analytics and predictive modelling of credit risk of bank customers Shao, Ziyang Wong Kin Shun, Terence School of Electrical and Electronic Engineering EKSWONG@ntu.edu.sg Engineering::Electrical and electronic engineering To reduce losses and increase profits, financial organizations must evaluate credit risk. In this article, we propose an ensemble model for assessing credit risk that combines the methods of neural networks and different machine-learning models. The dataset used for this research is obtained from the IEEE DataPort and consists of various credit risk-related features. A comprehensive pipeline is created that includes feature decomposition with an auto encoder, data balancing with adaptive synthetic sampling (ADASYN) and random under sampling, feature removal using the interquartile range (IQR) method, and elimination of highly correlated features through Pearson correlation. Comparative analysis is done between the proposed ensemble model and standalone traditional models like Logistic Regression, SVC etc. The results show that the proposed ensemble model outperforms individual models in predicting credit risk in both balanced and original datasets. The ensemble model performs exceptionally well on both balanced and original dataset in accurately classifying instances, distinguishing between positive and negative cases, and selecting relevant samples, with a harmonized average of precision and recall. On the balanced and original dataset, the proposed ensemble model achieves an accuracy, precision, re-call and F1 score of 0.9995 and 0.997 respectively. In essence, the proposed ensemble model provides a very effective and dependable solution for credit risk evaluation, greatly outperforming individual models in prediction performance. Financial organizations can use this approach to better manage risks associated with credit lending. Bachelor of Engineering (Electrical and Electronic Engineering) 2023-06-04T23:57:02Z 2023-06-04T23:57:02Z 2023 Final Year Project (FYP) Shao, Z. (2023). Data analytics and predictive modelling of credit risk of bank customers. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/167777 https://hdl.handle.net/10356/167777 en application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering |
spellingShingle |
Engineering::Electrical and electronic engineering Shao, Ziyang Data analytics and predictive modelling of credit risk of bank customers |
description |
To reduce losses and increase profits, financial organizations must evaluate credit risk. In this article, we propose an ensemble model for assessing credit risk that combines the methods of neural networks and different machine-learning models. The dataset used for this research is obtained from the IEEE DataPort and consists of various credit risk-related features. A comprehensive pipeline is created that includes feature decomposition with an auto encoder, data balancing with adaptive synthetic sampling (ADASYN) and random under sampling, feature removal using the interquartile range (IQR) method, and elimination of highly correlated features through Pearson correlation. Comparative analysis is done between the proposed ensemble model and standalone traditional models like Logistic Regression, SVC etc. The results show that the proposed ensemble model outperforms individual models in predicting credit risk in both balanced and original datasets. The ensemble model performs exceptionally well on both balanced and original dataset in accurately classifying instances, distinguishing between positive and negative cases, and selecting relevant samples, with a harmonized average of precision and recall. On the balanced and original dataset, the proposed ensemble model achieves an accuracy, precision, re-call and F1 score of 0.9995 and 0.997 respectively. In essence, the proposed ensemble model provides a very effective and dependable solution for credit risk evaluation, greatly outperforming individual models in prediction performance. Financial organizations can use this approach to better manage risks associated with credit lending. |
author2 |
Wong Kin Shun, Terence |
author_facet |
Wong Kin Shun, Terence Shao, Ziyang |
format |
Final Year Project |
author |
Shao, Ziyang |
author_sort |
Shao, Ziyang |
title |
Data analytics and predictive modelling of credit risk of bank customers |
title_short |
Data analytics and predictive modelling of credit risk of bank customers |
title_full |
Data analytics and predictive modelling of credit risk of bank customers |
title_fullStr |
Data analytics and predictive modelling of credit risk of bank customers |
title_full_unstemmed |
Data analytics and predictive modelling of credit risk of bank customers |
title_sort |
data analytics and predictive modelling of credit risk of bank customers |
publisher |
Nanyang Technological University |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/167777 |
_version_ |
1772827537566597120 |