Predicting fatal fall from heights accidents using random forest classification machine learning model

The focus of this paper is to use machine learning to create a prediction model that detects the probable factors impacting fatal falls from heights accidents at the Malaysia construction industry. The dataset used in this study was imported from the Department of Occupational Safety and Health of M...

Full description

Saved in:
Bibliographic Details
Main Authors: Zermane, Abderrahim, Mohd Tohir, Mohd Zahirasri, Zermane, Hanane, Baharudin, Mohd Rafee, Mohamed Yusoff, Hamdan
Format: Article
Published: Elsevier 2023
Online Access:http://psasir.upm.edu.my/id/eprint/110072/
https://www.sciencedirect.com/science/article/pii/S0925753522003629
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Putra Malaysia
id my.upm.eprints.110072
record_format eprints
spelling my.upm.eprints.1100722024-09-05T07:31:32Z http://psasir.upm.edu.my/id/eprint/110072/ Predicting fatal fall from heights accidents using random forest classification machine learning model Zermane, Abderrahim Mohd Tohir, Mohd Zahirasri Zermane, Hanane Baharudin, Mohd Rafee Mohamed Yusoff, Hamdan The focus of this paper is to use machine learning to create a prediction model that detects the probable factors impacting fatal falls from heights accidents at the Malaysia construction industry. The dataset used in this study was imported from the Department of Occupational Safety and Health of Malaysia’s industrial accident database. The dataset details 3321 accident scenarios which include; the date, the activity, the region, the summary of the accident, the direct cause, the root cause, and the factors. Seven machine learning models were tested to determine which model fits the dataset better, and as a result, the Random Forest Classification model was selected for this work. Random Forest classification tested several contributing factors such as: site conditions, management factors, individual characteristics and agent factors separately to determine their accuracy. Management factors and individual characteristics factors recorded the highest accuracy in every other prediction model; while agent factors recorded the highest accuracy in the random forest model. Additionally, this approach created ensemble predictions based on all of the dataset's characteristics. As a result, this study establishes the feasibility of machine learning in the field of construction safety management. The provided results can aid in accident prevention by increasing awareness of potential safety hazards, quantitatively predicting fatal accidents, and implementing the findings in potential safety management systems. Elsevier 2023 Article PeerReviewed Zermane, Abderrahim and Mohd Tohir, Mohd Zahirasri and Zermane, Hanane and Baharudin, Mohd Rafee and Mohamed Yusoff, Hamdan (2023) Predicting fatal fall from heights accidents using random forest classification machine learning model. Safety Science, 159. art. no. 106023. pp. 1-10. ISSN 0925-7535; ESSN: 1879-1042 https://www.sciencedirect.com/science/article/pii/S0925753522003629 10.1016/j.ssci.2022.106023
institution Universiti Putra Malaysia
building UPM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Putra Malaysia
content_source UPM Institutional Repository
url_provider http://psasir.upm.edu.my/
description The focus of this paper is to use machine learning to create a prediction model that detects the probable factors impacting fatal falls from heights accidents at the Malaysia construction industry. The dataset used in this study was imported from the Department of Occupational Safety and Health of Malaysia’s industrial accident database. The dataset details 3321 accident scenarios which include; the date, the activity, the region, the summary of the accident, the direct cause, the root cause, and the factors. Seven machine learning models were tested to determine which model fits the dataset better, and as a result, the Random Forest Classification model was selected for this work. Random Forest classification tested several contributing factors such as: site conditions, management factors, individual characteristics and agent factors separately to determine their accuracy. Management factors and individual characteristics factors recorded the highest accuracy in every other prediction model; while agent factors recorded the highest accuracy in the random forest model. Additionally, this approach created ensemble predictions based on all of the dataset's characteristics. As a result, this study establishes the feasibility of machine learning in the field of construction safety management. The provided results can aid in accident prevention by increasing awareness of potential safety hazards, quantitatively predicting fatal accidents, and implementing the findings in potential safety management systems.
format Article
author Zermane, Abderrahim
Mohd Tohir, Mohd Zahirasri
Zermane, Hanane
Baharudin, Mohd Rafee
Mohamed Yusoff, Hamdan
spellingShingle Zermane, Abderrahim
Mohd Tohir, Mohd Zahirasri
Zermane, Hanane
Baharudin, Mohd Rafee
Mohamed Yusoff, Hamdan
Predicting fatal fall from heights accidents using random forest classification machine learning model
author_facet Zermane, Abderrahim
Mohd Tohir, Mohd Zahirasri
Zermane, Hanane
Baharudin, Mohd Rafee
Mohamed Yusoff, Hamdan
author_sort Zermane, Abderrahim
title Predicting fatal fall from heights accidents using random forest classification machine learning model
title_short Predicting fatal fall from heights accidents using random forest classification machine learning model
title_full Predicting fatal fall from heights accidents using random forest classification machine learning model
title_fullStr Predicting fatal fall from heights accidents using random forest classification machine learning model
title_full_unstemmed Predicting fatal fall from heights accidents using random forest classification machine learning model
title_sort predicting fatal fall from heights accidents using random forest classification machine learning model
publisher Elsevier
publishDate 2023
url http://psasir.upm.edu.my/id/eprint/110072/
https://www.sciencedirect.com/science/article/pii/S0925753522003629
_version_ 1811686042948861952