Determining the credit worthiness of retail banking customer by machine learning technique
This research project conducts a comparative analysis of statistical and machine learning models for credit risk assessment, focusing on their performance in the context of imbalanced datasets common in loan default prediction. Traditional statistical models and advanced machine learning algorith...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/176996 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-176996 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1769962024-05-24T15:45:47Z Determining the credit worthiness of retail banking customer by machine learning technique Peng, Yangling Wong Kin Shun, Terence School of Electrical and Electronic Engineering EKSWONG@ntu.edu.sg Engineering Machine learning This research project conducts a comparative analysis of statistical and machine learning models for credit risk assessment, focusing on their performance in the context of imbalanced datasets common in loan default prediction. Traditional statistical models and advanced machine learning algorithms, including Decision Trees, Support Vector Machines (SVMs), and K-Nearest Neighbors (KNN), were evaluated to determine the most effective approach for predicting creditworthiness. Machine learning models outperformed their statistical counterparts, adeptly identifying complex, non-linear patterns and adjusting to data imbalance. Resampling techniques, especially AdaptiveSMOTE, enhanced the predictive accuracy of distance-based models such as KNN and SVM by ensuring a balanced class distribution, crucial for minority class prediction. Statistical models faced challenges with the skewed original dataset, which were mitigated by implementing SMOTE and AdaptiveSMOTE to balance class representation. These resampling strategies proved critical in improving the models' prediction reliability. The study addresses the acute challenge in banking: the scarcity of default data. The improved analytical methods bolstered the banks' capacity for precise credit risk management, thereby reinforcing the financial sector's defenses against potential defaults. Bachelor's degree 2024-05-24T06:15:32Z 2024-05-24T06:15:32Z 2024 Final Year Project (FYP) Peng, Y. (2024). Determining the credit worthiness of retail banking customer by machine learning technique. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/176996 https://hdl.handle.net/10356/176996 en A2250-231 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering Machine learning |
spellingShingle |
Engineering Machine learning Peng, Yangling Determining the credit worthiness of retail banking customer by machine learning technique |
description |
This research project conducts a comparative analysis of statistical and machine learning
models for credit risk assessment, focusing on their performance in the context of imbalanced
datasets common in loan default prediction. Traditional statistical models and advanced
machine learning algorithms, including Decision Trees, Support Vector Machines (SVMs), and
K-Nearest Neighbors (KNN), were evaluated to determine the most effective approach for
predicting creditworthiness. Machine learning models outperformed their statistical
counterparts, adeptly identifying complex, non-linear patterns and adjusting to data imbalance.
Resampling techniques, especially AdaptiveSMOTE, enhanced the predictive accuracy of
distance-based models such as KNN and SVM by ensuring a balanced class distribution,
crucial for minority class prediction. Statistical models faced challenges with the skewed
original dataset, which were mitigated by implementing SMOTE and AdaptiveSMOTE to
balance class representation. These resampling strategies proved critical in improving the
models' prediction reliability. The study addresses the acute challenge in banking: the scarcity
of default data. The improved analytical methods bolstered the banks' capacity for precise
credit risk management, thereby reinforcing the financial sector's defenses against potential
defaults. |
author2 |
Wong Kin Shun, Terence |
author_facet |
Wong Kin Shun, Terence Peng, Yangling |
format |
Final Year Project |
author |
Peng, Yangling |
author_sort |
Peng, Yangling |
title |
Determining the credit worthiness of retail banking customer by machine learning technique |
title_short |
Determining the credit worthiness of retail banking customer by machine learning technique |
title_full |
Determining the credit worthiness of retail banking customer by machine learning technique |
title_fullStr |
Determining the credit worthiness of retail banking customer by machine learning technique |
title_full_unstemmed |
Determining the credit worthiness of retail banking customer by machine learning technique |
title_sort |
determining the credit worthiness of retail banking customer by machine learning technique |
publisher |
Nanyang Technological University |
publishDate |
2024 |
url |
https://hdl.handle.net/10356/176996 |
_version_ |
1806059747457105920 |