Predicting Accuracy of Income a Year Using Rough Set Theory
The main objective of the experiments is to predict the accuracy of Adult dataset whether the income exceeds $50K per year or below $50K. Specifically, the objectives are to determine the best discretization method, split factor, reduction method, classifier and to build the classification model. In...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Language: | English English |
Published: |
2009
|
Subjects: | |
Online Access: | http://etd.uum.edu.my/2066/1/Zuraihah_Ngadengon.pdf http://etd.uum.edu.my/2066/2/1.Zuraihah_Ngadengon.pdf http://etd.uum.edu.my/2066/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Utara Malaysia |
Language: | English English |
id |
my.uum.etd.2066 |
---|---|
record_format |
eprints |
spelling |
my.uum.etd.20662013-07-24T12:14:14Z http://etd.uum.edu.my/2066/ Predicting Accuracy of Income a Year Using Rough Set Theory Zuraihah, Ngadengon QA273-280 Probabilities. Mathematical statistics The main objective of the experiments is to predict the accuracy of Adult dataset whether the income exceeds $50K per year or below $50K. Specifically, the objectives are to determine the best discretization method, split factor, reduction method, classifier and to build the classification model. In the experiments, the prediction of accuracy of the Adult dataset is developed by using rough set theory and Rosetta software while Knowledge Data Discovery (KDD) is used as the methodology. The Adult dataset that had been used in the experiments is comprises of 48,842 instances but only 24,999 instances is used along the experiments. Then, the data was randomly split into training data and testing data by using nine splits factor, which are 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8 and 0.9. The result obtained from the experiments showed that the best discretization method is Naive Algorithm, the best split factor is 0.6, the best reduction method is Johnson's Algorithm and the best classifier is Standard Voting. The highest percentage of accuracy achieved by the classification model developed using the rough set theory is 87.12%. The experiments showed that rough set theory is a useful approach to analyze the Adult dataset because the accuracy achieved in the experiments exceeds the previous methods that have been used before. 2009 Thesis NonPeerReviewed application/pdf en http://etd.uum.edu.my/2066/1/Zuraihah_Ngadengon.pdf application/pdf en http://etd.uum.edu.my/2066/2/1.Zuraihah_Ngadengon.pdf Zuraihah, Ngadengon (2009) Predicting Accuracy of Income a Year Using Rough Set Theory. Masters thesis, Universiti Utara Malaysia. |
institution |
Universiti Utara Malaysia |
building |
UUM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Utara Malaysia |
content_source |
UUM Electronic Theses |
url_provider |
http://etd.uum.edu.my/ |
language |
English English |
topic |
QA273-280 Probabilities. Mathematical statistics |
spellingShingle |
QA273-280 Probabilities. Mathematical statistics Zuraihah, Ngadengon Predicting Accuracy of Income a Year Using Rough Set Theory |
description |
The main objective of the experiments is to predict the accuracy of Adult dataset whether the income exceeds $50K per year or below $50K. Specifically, the objectives are to determine the best discretization method, split factor, reduction method, classifier and to build the classification model. In the experiments, the prediction of accuracy of the Adult dataset is developed by using rough set theory and Rosetta software while Knowledge Data Discovery (KDD) is used as the methodology. The Adult dataset that had been used in the experiments is comprises of 48,842 instances but only 24,999 instances is used along the experiments. Then, the data was randomly split into training data and testing data by using nine splits factor, which are 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8 and 0.9. The result obtained from the experiments showed that the best discretization method is Naive Algorithm, the best split factor is 0.6, the best reduction method is
Johnson's Algorithm and the best classifier is Standard Voting. The highest percentage of accuracy achieved by the classification model developed using the rough set theory is
87.12%. The experiments showed that rough set theory is a useful approach to analyze the Adult dataset because the accuracy achieved in the experiments exceeds the previous
methods that have been used before. |
format |
Thesis |
author |
Zuraihah, Ngadengon |
author_facet |
Zuraihah, Ngadengon |
author_sort |
Zuraihah, Ngadengon |
title |
Predicting Accuracy of Income a Year Using Rough Set Theory |
title_short |
Predicting Accuracy of Income a Year Using Rough Set Theory |
title_full |
Predicting Accuracy of Income a Year Using Rough Set Theory |
title_fullStr |
Predicting Accuracy of Income a Year Using Rough Set Theory |
title_full_unstemmed |
Predicting Accuracy of Income a Year Using Rough Set Theory |
title_sort |
predicting accuracy of income a year using rough set theory |
publishDate |
2009 |
url |
http://etd.uum.edu.my/2066/1/Zuraihah_Ngadengon.pdf http://etd.uum.edu.my/2066/2/1.Zuraihah_Ngadengon.pdf http://etd.uum.edu.my/2066/ |
_version_ |
1644276580748361728 |