Accident occurrence predictions with cost effective feature selection methods
Traffic accidents are a huge social cost that needs to be curbed and prediction of traffic accidents helps officials in implementing strategies for safer roads. Most of the research in this field focuses on using powerful classifiers for higher performance in making traffic accident predictions....
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/141918 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-141918 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1419182020-06-11T11:19:16Z Accident occurrence predictions with cost effective feature selection methods Garimella Krishna Apoorva Zhu Feng School of Civil and Environmental Engineering zhufeng@ntu.edu.sg Engineering::Civil engineering Traffic accidents are a huge social cost that needs to be curbed and prediction of traffic accidents helps officials in implementing strategies for safer roads. Most of the research in this field focuses on using powerful classifiers for higher performance in making traffic accident predictions. This report works on building a high performing model by introducing cost-sensitive feature selection methods as a pre processing step that helps to curb the class imbalance and noisy features problem most accident prediction models face when only a classifier without a pre-processing step is used. This study introduces three cost-sensitive feature selection methods that factor in imbalanced data and helps in figuring out important features using various performance metrics. The selected features are then used in the Cost-sensitive Support Vector Machine Classifier to check for the performance. K-Nearest Neighbor classifier was used as well to draw comparisons on the performance of the three feature selection methods. Overall, the report has found feature selection to be an effective pre-processing step for traffic accident prediction and of all features used, weather was found to be the most important feature with the highest feature score and AUC score, followed by features such as location, time and speed. The results were comparable to previous researches in determining important features for traffic accident prediction. Bachelor of Engineering (Civil) 2020-06-11T11:19:16Z 2020-06-11T11:19:16Z 2020 Final Year Project (FYP) https://hdl.handle.net/10356/141918 en TR-09 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
country |
Singapore |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Civil engineering |
spellingShingle |
Engineering::Civil engineering Garimella Krishna Apoorva Accident occurrence predictions with cost effective feature selection methods |
description |
Traffic accidents are a huge social cost that needs to be curbed and prediction of traffic accidents helps
officials in implementing strategies for safer roads. Most of the research in this field focuses on using
powerful classifiers for higher performance in making traffic accident predictions. This report works
on building a high performing model by introducing cost-sensitive feature selection methods as a pre
processing step that helps to curb the class imbalance and noisy features problem most accident
prediction models face when only a classifier without a pre-processing step is used. This study
introduces three cost-sensitive feature selection methods that factor in imbalanced data and helps in
figuring out important features using various performance metrics. The selected features are then
used in the Cost-sensitive Support Vector Machine Classifier to check for the performance. K-Nearest
Neighbor classifier was used as well to draw comparisons on the performance of the three feature
selection methods. Overall, the report has found feature selection to be an effective pre-processing
step for traffic accident prediction and of all features used, weather was found to be the most
important feature with the highest feature score and AUC score, followed by features such as location,
time and speed. The results were comparable to previous researches in determining important
features for traffic accident prediction. |
author2 |
Zhu Feng |
author_facet |
Zhu Feng Garimella Krishna Apoorva |
format |
Final Year Project |
author |
Garimella Krishna Apoorva |
author_sort |
Garimella Krishna Apoorva |
title |
Accident occurrence predictions with cost effective feature selection methods |
title_short |
Accident occurrence predictions with cost effective feature selection methods |
title_full |
Accident occurrence predictions with cost effective feature selection methods |
title_fullStr |
Accident occurrence predictions with cost effective feature selection methods |
title_full_unstemmed |
Accident occurrence predictions with cost effective feature selection methods |
title_sort |
accident occurrence predictions with cost effective feature selection methods |
publisher |
Nanyang Technological University |
publishDate |
2020 |
url |
https://hdl.handle.net/10356/141918 |
_version_ |
1681057677457752064 |