Determining the credit worthiness of retail banking customer by machine learning technique

This research project conducts a comparative analysis of statistical and machine learning models for credit risk assessment, focusing on their performance in the context of imbalanced datasets common in loan default prediction. Traditional statistical models and advanced machine learning algorith...

Full description

Saved in:
Bibliographic Details
Main Author: Peng, Yangling
Other Authors: Wong Kin Shun, Terence
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/176996
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-176996
record_format dspace
spelling sg-ntu-dr.10356-1769962024-05-24T15:45:47Z Determining the credit worthiness of retail banking customer by machine learning technique Peng, Yangling Wong Kin Shun, Terence School of Electrical and Electronic Engineering EKSWONG@ntu.edu.sg Engineering Machine learning This research project conducts a comparative analysis of statistical and machine learning models for credit risk assessment, focusing on their performance in the context of imbalanced datasets common in loan default prediction. Traditional statistical models and advanced machine learning algorithms, including Decision Trees, Support Vector Machines (SVMs), and K-Nearest Neighbors (KNN), were evaluated to determine the most effective approach for predicting creditworthiness. Machine learning models outperformed their statistical counterparts, adeptly identifying complex, non-linear patterns and adjusting to data imbalance. Resampling techniques, especially AdaptiveSMOTE, enhanced the predictive accuracy of distance-based models such as KNN and SVM by ensuring a balanced class distribution, crucial for minority class prediction. Statistical models faced challenges with the skewed original dataset, which were mitigated by implementing SMOTE and AdaptiveSMOTE to balance class representation. These resampling strategies proved critical in improving the models' prediction reliability. The study addresses the acute challenge in banking: the scarcity of default data. The improved analytical methods bolstered the banks' capacity for precise credit risk management, thereby reinforcing the financial sector's defenses against potential defaults. Bachelor's degree 2024-05-24T06:15:32Z 2024-05-24T06:15:32Z 2024 Final Year Project (FYP) Peng, Y. (2024). Determining the credit worthiness of retail banking customer by machine learning technique. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/176996 https://hdl.handle.net/10356/176996 en A2250-231 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering
Machine learning
spellingShingle Engineering
Machine learning
Peng, Yangling
Determining the credit worthiness of retail banking customer by machine learning technique
description This research project conducts a comparative analysis of statistical and machine learning models for credit risk assessment, focusing on their performance in the context of imbalanced datasets common in loan default prediction. Traditional statistical models and advanced machine learning algorithms, including Decision Trees, Support Vector Machines (SVMs), and K-Nearest Neighbors (KNN), were evaluated to determine the most effective approach for predicting creditworthiness. Machine learning models outperformed their statistical counterparts, adeptly identifying complex, non-linear patterns and adjusting to data imbalance. Resampling techniques, especially AdaptiveSMOTE, enhanced the predictive accuracy of distance-based models such as KNN and SVM by ensuring a balanced class distribution, crucial for minority class prediction. Statistical models faced challenges with the skewed original dataset, which were mitigated by implementing SMOTE and AdaptiveSMOTE to balance class representation. These resampling strategies proved critical in improving the models' prediction reliability. The study addresses the acute challenge in banking: the scarcity of default data. The improved analytical methods bolstered the banks' capacity for precise credit risk management, thereby reinforcing the financial sector's defenses against potential defaults.
author2 Wong Kin Shun, Terence
author_facet Wong Kin Shun, Terence
Peng, Yangling
format Final Year Project
author Peng, Yangling
author_sort Peng, Yangling
title Determining the credit worthiness of retail banking customer by machine learning technique
title_short Determining the credit worthiness of retail banking customer by machine learning technique
title_full Determining the credit worthiness of retail banking customer by machine learning technique
title_fullStr Determining the credit worthiness of retail banking customer by machine learning technique
title_full_unstemmed Determining the credit worthiness of retail banking customer by machine learning technique
title_sort determining the credit worthiness of retail banking customer by machine learning technique
publisher Nanyang Technological University
publishDate 2024
url https://hdl.handle.net/10356/176996
_version_ 1806059747457105920