Accident occurrence predictions with cost effective feature selection methods

Traffic accidents are a huge social cost that needs to be curbed and prediction of traffic accidents helps officials in implementing strategies for safer roads. Most of the research in this field focuses on using powerful classifiers for higher performance in making traffic accident predictions....

Full description

Saved in:
Bibliographic Details
Main Author: Garimella Krishna Apoorva
Other Authors: Zhu Feng
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2020
Subjects:
Online Access:https://hdl.handle.net/10356/141918
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-141918
record_format dspace
spelling sg-ntu-dr.10356-1419182020-06-11T11:19:16Z Accident occurrence predictions with cost effective feature selection methods Garimella Krishna Apoorva Zhu Feng School of Civil and Environmental Engineering zhufeng@ntu.edu.sg Engineering::Civil engineering Traffic accidents are a huge social cost that needs to be curbed and prediction of traffic accidents helps officials in implementing strategies for safer roads. Most of the research in this field focuses on using powerful classifiers for higher performance in making traffic accident predictions. This report works on building a high performing model by introducing cost-sensitive feature selection methods as a pre processing step that helps to curb the class imbalance and noisy features problem most accident prediction models face when only a classifier without a pre-processing step is used. This study introduces three cost-sensitive feature selection methods that factor in imbalanced data and helps in figuring out important features using various performance metrics. The selected features are then used in the Cost-sensitive Support Vector Machine Classifier to check for the performance. K-Nearest Neighbor classifier was used as well to draw comparisons on the performance of the three feature selection methods. Overall, the report has found feature selection to be an effective pre-processing step for traffic accident prediction and of all features used, weather was found to be the most important feature with the highest feature score and AUC score, followed by features such as location, time and speed. The results were comparable to previous researches in determining important features for traffic accident prediction. Bachelor of Engineering (Civil) 2020-06-11T11:19:16Z 2020-06-11T11:19:16Z 2020 Final Year Project (FYP) https://hdl.handle.net/10356/141918 en TR-09 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic Engineering::Civil engineering
spellingShingle Engineering::Civil engineering
Garimella Krishna Apoorva
Accident occurrence predictions with cost effective feature selection methods
description Traffic accidents are a huge social cost that needs to be curbed and prediction of traffic accidents helps officials in implementing strategies for safer roads. Most of the research in this field focuses on using powerful classifiers for higher performance in making traffic accident predictions. This report works on building a high performing model by introducing cost-sensitive feature selection methods as a pre processing step that helps to curb the class imbalance and noisy features problem most accident prediction models face when only a classifier without a pre-processing step is used. This study introduces three cost-sensitive feature selection methods that factor in imbalanced data and helps in figuring out important features using various performance metrics. The selected features are then used in the Cost-sensitive Support Vector Machine Classifier to check for the performance. K-Nearest Neighbor classifier was used as well to draw comparisons on the performance of the three feature selection methods. Overall, the report has found feature selection to be an effective pre-processing step for traffic accident prediction and of all features used, weather was found to be the most important feature with the highest feature score and AUC score, followed by features such as location, time and speed. The results were comparable to previous researches in determining important features for traffic accident prediction.
author2 Zhu Feng
author_facet Zhu Feng
Garimella Krishna Apoorva
format Final Year Project
author Garimella Krishna Apoorva
author_sort Garimella Krishna Apoorva
title Accident occurrence predictions with cost effective feature selection methods
title_short Accident occurrence predictions with cost effective feature selection methods
title_full Accident occurrence predictions with cost effective feature selection methods
title_fullStr Accident occurrence predictions with cost effective feature selection methods
title_full_unstemmed Accident occurrence predictions with cost effective feature selection methods
title_sort accident occurrence predictions with cost effective feature selection methods
publisher Nanyang Technological University
publishDate 2020
url https://hdl.handle.net/10356/141918
_version_ 1681057677457752064