Achieving higher classification accuracy with ensemble of trees

Classification is a process where a classifier predicts a class label to an object using the set of inputs. One simple method to solve classification problems is a decision tree, a classifier which can be easily interpreted with a graph and yet produces potentially high accuracies. However, there is...

Full description

Saved in:
Bibliographic Details
Main Author: Cheng, Wen Xin
Other Authors: Ponnuthurai Nagaratnam Suganthan
Format: Final Year Project
Language:English
Published: 2017
Subjects:
Online Access:http://hdl.handle.net/10356/71661
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-71661
record_format dspace
spelling sg-ntu-dr.10356-716612023-07-07T17:22:05Z Achieving higher classification accuracy with ensemble of trees Cheng, Wen Xin Ponnuthurai Nagaratnam Suganthan School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering Classification is a process where a classifier predicts a class label to an object using the set of inputs. One simple method to solve classification problems is a decision tree, a classifier which can be easily interpreted with a graph and yet produces potentially high accuracies. However, there is a limitation of this method: decision tree is unstable classifiers. This implies that a small change in the dataset results in completely different structure of the decision tree. Therefore, there is a need for ensemble methods, which can significantly improve the performance. In this project, we study on standard decision trees and their ensembles: Bootstrapped Aggregating, Random Forest, Extremely Randomised Trees, Rotation Forest, Gradient Boosting and Adaptive Boosting, and assess the performance of selected classifiers on real-world datasets. In addition, we propose a new ensemble method called Heterogeneous Ensemble of trees and compare its performance with existing methods. From the evaluation of 6 different classifiers on 10 real-world datasets, it shows that Heterogeneous Ensemble of trees outperforms other classifiers including Random Forest, the best ranked classifiers among 179 classifiers in a recent survey with 121 real-world datasets. Bachelor of Engineering 2017-05-18T06:47:53Z 2017-05-18T06:47:53Z 2017 Final Year Project (FYP) http://hdl.handle.net/10356/71661 en Nanyang Technological University 54 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Electrical and electronic engineering
spellingShingle DRNTU::Engineering::Electrical and electronic engineering
Cheng, Wen Xin
Achieving higher classification accuracy with ensemble of trees
description Classification is a process where a classifier predicts a class label to an object using the set of inputs. One simple method to solve classification problems is a decision tree, a classifier which can be easily interpreted with a graph and yet produces potentially high accuracies. However, there is a limitation of this method: decision tree is unstable classifiers. This implies that a small change in the dataset results in completely different structure of the decision tree. Therefore, there is a need for ensemble methods, which can significantly improve the performance. In this project, we study on standard decision trees and their ensembles: Bootstrapped Aggregating, Random Forest, Extremely Randomised Trees, Rotation Forest, Gradient Boosting and Adaptive Boosting, and assess the performance of selected classifiers on real-world datasets. In addition, we propose a new ensemble method called Heterogeneous Ensemble of trees and compare its performance with existing methods. From the evaluation of 6 different classifiers on 10 real-world datasets, it shows that Heterogeneous Ensemble of trees outperforms other classifiers including Random Forest, the best ranked classifiers among 179 classifiers in a recent survey with 121 real-world datasets.
author2 Ponnuthurai Nagaratnam Suganthan
author_facet Ponnuthurai Nagaratnam Suganthan
Cheng, Wen Xin
format Final Year Project
author Cheng, Wen Xin
author_sort Cheng, Wen Xin
title Achieving higher classification accuracy with ensemble of trees
title_short Achieving higher classification accuracy with ensemble of trees
title_full Achieving higher classification accuracy with ensemble of trees
title_fullStr Achieving higher classification accuracy with ensemble of trees
title_full_unstemmed Achieving higher classification accuracy with ensemble of trees
title_sort achieving higher classification accuracy with ensemble of trees
publishDate 2017
url http://hdl.handle.net/10356/71661
_version_ 1772829075686031360