Data driven determination of retail banking customer credit worthiness
Credit risk assessment is a major and critical challenge in the finance industry, as accurate and precise predictions of customer creditworthiness can impact lending decisions and financial stability. This project will explore the application of machine learning models to predict credit risk, lever...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/181758 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-181758 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1817582024-12-20T15:45:59Z Data driven determination of retail banking customer credit worthiness Wang, Christabell Weiqi Wong Kin Shun, Terence School of Electrical and Electronic Engineering EKSWONG@ntu.edu.sg Engineering Credit-risk Credit risk assessment is a major and critical challenge in the finance industry, as accurate and precise predictions of customer creditworthiness can impact lending decisions and financial stability. This project will explore the application of machine learning models to predict credit risk, leveraging imbalanced datasets and feature engineering techniques to improve performance of the models. Our research compares traditional classifiers such as Logistic Regression and Decision Trees with more advanced ensemble models like Random Forest, LightGBM, and XGBoost. To address the heavy imbalance in the dataset, Synthetic Minority Oversampling Technique (SMOTE) was applied to the training data. Furthermore, SelectKBest was employed for feature selection based on statistical relevance. The results indicate that ensemble models outperform traditional classifiers, achieving higher precision, recall, and ROC-AUC scores. A final ensemble model combining the strengths of Random Forest, LightGBM, and XGBoost was used to achieve optimal performance. The study highlights the importance of balancing datasets, meaningful feature engineering, and thorough evaluation metrics for robust credit risk analysis. By integrating domain knowledge with machine learning techniques, this research demonstrates the potential for improving credit risk prediction accuracy while reducing the reliance on purely heuristic methods. The findings contribute to advancing machine learning applications in the financial domain, offering a foundation for future research on integrating advanced models and addressing ethical considerations in automated lending systems. This work emphasizes the transformative potential of machine learning in credit risk assessment, paving the way for more efficient and equitable financial decision-making processes. Bachelor's degree 2024-12-17T12:51:16Z 2024-12-17T12:51:16Z 2024 Final Year Project (FYP) Wang, C. W. (2024). Data driven determination of retail banking customer credit worthiness. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/181758 https://hdl.handle.net/10356/181758 en A2380-232 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering Credit-risk |
spellingShingle |
Engineering Credit-risk Wang, Christabell Weiqi Data driven determination of retail banking customer credit worthiness |
description |
Credit risk assessment is a major and critical challenge in the finance industry, as accurate and precise predictions of customer creditworthiness can impact lending decisions and financial stability. This project will explore the application of machine learning models to predict credit risk, leveraging imbalanced datasets and feature engineering techniques to improve performance of the models. Our research compares traditional classifiers such as Logistic Regression and Decision Trees with more advanced ensemble models like Random Forest, LightGBM, and XGBoost. To address the heavy imbalance in the dataset, Synthetic Minority Oversampling Technique (SMOTE) was applied to the training data. Furthermore, SelectKBest was employed for feature selection based on statistical relevance.
The results indicate that ensemble models outperform traditional classifiers, achieving higher precision, recall, and ROC-AUC scores. A final ensemble model combining the strengths of Random Forest, LightGBM, and XGBoost was used to achieve optimal performance. The study highlights the importance of balancing datasets, meaningful feature engineering, and thorough evaluation metrics for robust credit risk analysis. By integrating domain knowledge with machine learning techniques, this research demonstrates the potential for improving credit risk prediction accuracy while reducing the reliance on purely heuristic methods.
The findings contribute to advancing machine learning applications in the financial domain, offering a foundation for future research on integrating advanced models and addressing ethical considerations in automated lending systems. This work emphasizes the transformative potential of machine learning in credit risk assessment, paving the way for more efficient and equitable financial decision-making processes. |
author2 |
Wong Kin Shun, Terence |
author_facet |
Wong Kin Shun, Terence Wang, Christabell Weiqi |
format |
Final Year Project |
author |
Wang, Christabell Weiqi |
author_sort |
Wang, Christabell Weiqi |
title |
Data driven determination of retail banking customer credit worthiness |
title_short |
Data driven determination of retail banking customer credit worthiness |
title_full |
Data driven determination of retail banking customer credit worthiness |
title_fullStr |
Data driven determination of retail banking customer credit worthiness |
title_full_unstemmed |
Data driven determination of retail banking customer credit worthiness |
title_sort |
data driven determination of retail banking customer credit worthiness |
publisher |
Nanyang Technological University |
publishDate |
2024 |
url |
https://hdl.handle.net/10356/181758 |
_version_ |
1819112991626362880 |