Data analytics and predictive modelling of credit risk of bank customers

To reduce losses and increase profits, financial organizations must evaluate credit risk. In this article, we propose an ensemble model for assessing credit risk that combines the methods of neural networks and different machine-learning models. The dataset used for this research is obtained from th...

Full description

Saved in:
Bibliographic Details
Main Author: Shao, Ziyang
Other Authors: Wong Kin Shun, Terence
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2023
Subjects:
Online Access:https://hdl.handle.net/10356/167777
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-167777
record_format dspace
spelling sg-ntu-dr.10356-1677772023-07-07T15:42:53Z Data analytics and predictive modelling of credit risk of bank customers Shao, Ziyang Wong Kin Shun, Terence School of Electrical and Electronic Engineering EKSWONG@ntu.edu.sg Engineering::Electrical and electronic engineering To reduce losses and increase profits, financial organizations must evaluate credit risk. In this article, we propose an ensemble model for assessing credit risk that combines the methods of neural networks and different machine-learning models. The dataset used for this research is obtained from the IEEE DataPort and consists of various credit risk-related features. A comprehensive pipeline is created that includes feature decomposition with an auto encoder, data balancing with adaptive synthetic sampling (ADASYN) and random under sampling, feature removal using the interquartile range (IQR) method, and elimination of highly correlated features through Pearson correlation. Comparative analysis is done between the proposed ensemble model and standalone traditional models like Logistic Regression, SVC etc. The results show that the proposed ensemble model outperforms individual models in predicting credit risk in both balanced and original datasets. The ensemble model performs exceptionally well on both balanced and original dataset in accurately classifying instances, distinguishing between positive and negative cases, and selecting relevant samples, with a harmonized average of precision and recall. On the balanced and original dataset, the proposed ensemble model achieves an accuracy, precision, re-call and F1 score of 0.9995 and 0.997 respectively. In essence, the proposed ensemble model provides a very effective and dependable solution for credit risk evaluation, greatly outperforming individual models in prediction performance. Financial organizations can use this approach to better manage risks associated with credit lending. Bachelor of Engineering (Electrical and Electronic Engineering) 2023-06-04T23:57:02Z 2023-06-04T23:57:02Z 2023 Final Year Project (FYP) Shao, Z. (2023). Data analytics and predictive modelling of credit risk of bank customers. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/167777 https://hdl.handle.net/10356/167777 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Electrical and electronic engineering
spellingShingle Engineering::Electrical and electronic engineering
Shao, Ziyang
Data analytics and predictive modelling of credit risk of bank customers
description To reduce losses and increase profits, financial organizations must evaluate credit risk. In this article, we propose an ensemble model for assessing credit risk that combines the methods of neural networks and different machine-learning models. The dataset used for this research is obtained from the IEEE DataPort and consists of various credit risk-related features. A comprehensive pipeline is created that includes feature decomposition with an auto encoder, data balancing with adaptive synthetic sampling (ADASYN) and random under sampling, feature removal using the interquartile range (IQR) method, and elimination of highly correlated features through Pearson correlation. Comparative analysis is done between the proposed ensemble model and standalone traditional models like Logistic Regression, SVC etc. The results show that the proposed ensemble model outperforms individual models in predicting credit risk in both balanced and original datasets. The ensemble model performs exceptionally well on both balanced and original dataset in accurately classifying instances, distinguishing between positive and negative cases, and selecting relevant samples, with a harmonized average of precision and recall. On the balanced and original dataset, the proposed ensemble model achieves an accuracy, precision, re-call and F1 score of 0.9995 and 0.997 respectively. In essence, the proposed ensemble model provides a very effective and dependable solution for credit risk evaluation, greatly outperforming individual models in prediction performance. Financial organizations can use this approach to better manage risks associated with credit lending.
author2 Wong Kin Shun, Terence
author_facet Wong Kin Shun, Terence
Shao, Ziyang
format Final Year Project
author Shao, Ziyang
author_sort Shao, Ziyang
title Data analytics and predictive modelling of credit risk of bank customers
title_short Data analytics and predictive modelling of credit risk of bank customers
title_full Data analytics and predictive modelling of credit risk of bank customers
title_fullStr Data analytics and predictive modelling of credit risk of bank customers
title_full_unstemmed Data analytics and predictive modelling of credit risk of bank customers
title_sort data analytics and predictive modelling of credit risk of bank customers
publisher Nanyang Technological University
publishDate 2023
url https://hdl.handle.net/10356/167777
_version_ 1772827537566597120