The impact of fuzzy discretization�s output on classification accuracy of random forest classifier

Random Forest is known as among the widely used classification algorithms by researchers and machine learning enthusiast in solving classification problems. Recently, fuzzy discretization has been paired with Random Forest (RF) classifier to enhance the classification accuracy of Random Forest class...

Full description

Saved in:
Bibliographic Details
Main Authors: Fikri, M.N., Hassan, M.F., Tran, D.C.
Format: Article
Published: World Academy of Research in Science and Engineering 2020
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85087461488&doi=10.30534%2fijatcse%2f2020%2f218932020&partnerID=40&md5=5f09fbd5c4968167d06c3cfa5ad3bb62
http://eprints.utp.edu.my/23170/
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Petronas
Description
Summary:Random Forest is known as among the widely used classification algorithms by researchers and machine learning enthusiast in solving classification problems. Recently, fuzzy discretization has been paired with Random Forest (RF) classifier to enhance the classification accuracy of Random Forest classifier when dealing with continuous variables. However, there are many different opinions on whether there is a need to perform discretization in data pre-processing for tree-based classifiers such as J48, Decision Tree and Random Forest. On top of that, it is known that different classification algorithms produce different classification accuracies depending on the type of data used. In other words, the output of data discretization process. Thus, to unravel this mentioned hypothesis, this study intends to shed some lights on the impact of different fuzzy discretization�s output on the classification accuracy of Random Forest classifier. In this study, three version of simulations were done with different fuzzy discretization output. Those fuzzy discretization�s outputs are 1) without fuzzy discretization 2) with fully fuzzy discretization and 3) with partial fuzzy discretization. Then, classification phase is done through Random Forest classifier and the classification accuracy for all the simulation versions were observed, recorded, and analyzed. © 2020, World Academy of Research in Science and Engineering. All rights reserved.