Achieving higher classification accuracy with ensemble of trees
Classification is a process where a classifier predicts a class label to an object using the set of inputs. One simple method to solve classification problems is a decision tree, a classifier which can be easily interpreted with a graph and yet produces potentially high accuracies. However, there is...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2017
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/71661 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-71661 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-716612023-07-07T17:22:05Z Achieving higher classification accuracy with ensemble of trees Cheng, Wen Xin Ponnuthurai Nagaratnam Suganthan School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering Classification is a process where a classifier predicts a class label to an object using the set of inputs. One simple method to solve classification problems is a decision tree, a classifier which can be easily interpreted with a graph and yet produces potentially high accuracies. However, there is a limitation of this method: decision tree is unstable classifiers. This implies that a small change in the dataset results in completely different structure of the decision tree. Therefore, there is a need for ensemble methods, which can significantly improve the performance. In this project, we study on standard decision trees and their ensembles: Bootstrapped Aggregating, Random Forest, Extremely Randomised Trees, Rotation Forest, Gradient Boosting and Adaptive Boosting, and assess the performance of selected classifiers on real-world datasets. In addition, we propose a new ensemble method called Heterogeneous Ensemble of trees and compare its performance with existing methods. From the evaluation of 6 different classifiers on 10 real-world datasets, it shows that Heterogeneous Ensemble of trees outperforms other classifiers including Random Forest, the best ranked classifiers among 179 classifiers in a recent survey with 121 real-world datasets. Bachelor of Engineering 2017-05-18T06:47:53Z 2017-05-18T06:47:53Z 2017 Final Year Project (FYP) http://hdl.handle.net/10356/71661 en Nanyang Technological University 54 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Electrical and electronic engineering |
spellingShingle |
DRNTU::Engineering::Electrical and electronic engineering Cheng, Wen Xin Achieving higher classification accuracy with ensemble of trees |
description |
Classification is a process where a classifier predicts a class label to an object using the set of inputs. One simple method to solve classification problems is a decision tree, a classifier which can be easily interpreted with a graph and yet produces potentially high accuracies. However, there is a limitation of this method: decision tree is unstable classifiers. This implies that a small change in the dataset results in completely different structure of the decision tree. Therefore, there is a need for ensemble methods, which can significantly improve the performance.
In this project, we study on standard decision trees and their ensembles: Bootstrapped Aggregating, Random Forest, Extremely Randomised Trees, Rotation Forest, Gradient Boosting and Adaptive Boosting, and assess the performance of selected classifiers on real-world datasets. In addition, we propose a new ensemble method called Heterogeneous Ensemble of trees and compare its performance with existing methods.
From the evaluation of 6 different classifiers on 10 real-world datasets, it shows that Heterogeneous Ensemble of trees outperforms other classifiers including Random Forest, the best ranked classifiers among 179 classifiers in a recent survey with 121 real-world datasets. |
author2 |
Ponnuthurai Nagaratnam Suganthan |
author_facet |
Ponnuthurai Nagaratnam Suganthan Cheng, Wen Xin |
format |
Final Year Project |
author |
Cheng, Wen Xin |
author_sort |
Cheng, Wen Xin |
title |
Achieving higher classification accuracy with ensemble of trees |
title_short |
Achieving higher classification accuracy with ensemble of trees |
title_full |
Achieving higher classification accuracy with ensemble of trees |
title_fullStr |
Achieving higher classification accuracy with ensemble of trees |
title_full_unstemmed |
Achieving higher classification accuracy with ensemble of trees |
title_sort |
achieving higher classification accuracy with ensemble of trees |
publishDate |
2017 |
url |
http://hdl.handle.net/10356/71661 |
_version_ |
1772829075686031360 |