A comparative study on the pre-processing and mining of Pima Indian diabetes dataset
Data mining in medical data has successfully converted raw data into useful information. This information helps the medical experts in improving the diagnosis and treatment of diseases. In this paper, we review studied data mining applications applied exclusively on an open source diabetes dataset....
Saved in:
Main Authors: | , , |
---|---|
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2013
|
Subjects: | |
Online Access: | http://irep.iium.edu.my/65988/1/UMP-%20A%20Comparative%20Study%20on%20the%20pre-processing%20and%20mining%20of%20prima%20Indian%20Diabetes%20Dataset.pdf http://irep.iium.edu.my/65988/ http://icsecs.ump.edu.my/index.php/en/icsecs13 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Islam Antarabangsa Malaysia |
Language: | English |
id |
my.iium.irep.65988 |
---|---|
record_format |
dspace |
spelling |
my.iium.irep.659882018-09-04T09:08:40Z http://irep.iium.edu.my/65988/ A comparative study on the pre-processing and mining of Pima Indian diabetes dataset Amatul, Zehra Tuty, Asmawaty Md Aris, Mohd Aznan R Medicine (General) RZ Other systems of medicine Data mining in medical data has successfully converted raw data into useful information. This information helps the medical experts in improving the diagnosis and treatment of diseases. In this paper, we review studied data mining applications applied exclusively on an open source diabetes dataset. Type II Diabetes Mellitus is one of the silent killer diseases worldwide. According to the World Health Organization, 346 million people are suffering from diabetes worldwide. Diagnosis or prediction of diabetes is done through various data mining techniques such as association, classification, clustering and pattern recognition. The study led to the related open issues of identifying the need of a relation between the major factors that lead to the development of diabetes. This is possible by mining patterns found between the independent and dependant variables in the dataset. This paper compares the classification accuracies of non-processed and pre-processed data. The results clearly show that the pre-processed data gives better classification accuracy. 2013-08-29 Conference or Workshop Item NonPeerReviewed application/pdf en http://irep.iium.edu.my/65988/1/UMP-%20A%20Comparative%20Study%20on%20the%20pre-processing%20and%20mining%20of%20prima%20Indian%20Diabetes%20Dataset.pdf Amatul, Zehra and Tuty, Asmawaty and Md Aris, Mohd Aznan (2013) A comparative study on the pre-processing and mining of Pima Indian diabetes dataset. In: 3rd International Conference on Software Engineering & Computer Systems (ICSECS - 2013), 20th-22nd Aug. 2013, Gambang, Kuantan, Pahang. (Unpublished) http://icsecs.ump.edu.my/index.php/en/icsecs13 |
institution |
Universiti Islam Antarabangsa Malaysia |
building |
IIUM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
International Islamic University Malaysia |
content_source |
IIUM Repository (IREP) |
url_provider |
http://irep.iium.edu.my/ |
language |
English |
topic |
R Medicine (General) RZ Other systems of medicine |
spellingShingle |
R Medicine (General) RZ Other systems of medicine Amatul, Zehra Tuty, Asmawaty Md Aris, Mohd Aznan A comparative study on the pre-processing and mining of Pima Indian diabetes dataset |
description |
Data mining in medical data has successfully converted raw data into useful information. This information helps the medical experts in improving the diagnosis and treatment of diseases. In this paper, we review studied data mining applications applied exclusively on an open source diabetes dataset. Type II Diabetes Mellitus is one of the silent killer diseases worldwide. According to the World Health Organization, 346 million people are suffering from diabetes worldwide. Diagnosis or prediction of diabetes is done through various data mining techniques such as association, classification, clustering and pattern recognition. The study led to the related open issues of identifying the need of a relation between the major factors that lead to the development of diabetes. This is possible by mining patterns found between the independent and dependant variables in the dataset. This paper compares the classification accuracies of non-processed and pre-processed data. The results clearly show that the pre-processed data gives better classification accuracy. |
format |
Conference or Workshop Item |
author |
Amatul, Zehra Tuty, Asmawaty Md Aris, Mohd Aznan |
author_facet |
Amatul, Zehra Tuty, Asmawaty Md Aris, Mohd Aznan |
author_sort |
Amatul, Zehra |
title |
A comparative study on the pre-processing and mining of Pima Indian diabetes dataset |
title_short |
A comparative study on the pre-processing and mining of Pima Indian diabetes dataset |
title_full |
A comparative study on the pre-processing and mining of Pima Indian diabetes dataset |
title_fullStr |
A comparative study on the pre-processing and mining of Pima Indian diabetes dataset |
title_full_unstemmed |
A comparative study on the pre-processing and mining of Pima Indian diabetes dataset |
title_sort |
comparative study on the pre-processing and mining of pima indian diabetes dataset |
publishDate |
2013 |
url |
http://irep.iium.edu.my/65988/1/UMP-%20A%20Comparative%20Study%20on%20the%20pre-processing%20and%20mining%20of%20prima%20Indian%20Diabetes%20Dataset.pdf http://irep.iium.edu.my/65988/ http://icsecs.ump.edu.my/index.php/en/icsecs13 |
_version_ |
1643617669647171584 |