Accuracy and performance analysis for classification algorithms based on biomedical datasets

Diseases chronic, including heart disease, cancer, diabetes, and obesity, are the main causes of mortality in the United States and accounting for and consuming the majority of the country’s healthcare expenditure. As indicated by recent researches. The main reason for the emergence of these disease...

Full description

Saved in:
Bibliographic Details
Main Authors: Al-Hameli, Bassam Abdo, Alsewari, Abdulrahman A., Khubrani, Mousa, Fakhreldin, Mohammoud
Format: Conference or Workshop Item
Language:English
Published: IEEE 2021
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/32649/1/Accuracy_and_performance_analysis_for_classification_algorithms_based_on_biomedical_datasets.pdf
http://umpir.ump.edu.my/id/eprint/32649/
https://doi.org/10.1109/ICSECS52883.2021.00119
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Malaysia Pahang
Language: English
id my.ump.umpir.32649
record_format eprints
spelling my.ump.umpir.326492021-11-24T09:15:56Z http://umpir.ump.edu.my/id/eprint/32649/ Accuracy and performance analysis for classification algorithms based on biomedical datasets Al-Hameli, Bassam Abdo Alsewari, Abdulrahman A. Khubrani, Mousa Fakhreldin, Mohammoud QA Mathematics Diseases chronic, including heart disease, cancer, diabetes, and obesity, are the main causes of mortality in the United States and accounting for and consuming the majority of the country’s healthcare expenditure. As indicated by recent researches. The main reason for the emergence of these diseases prominently is their relationship to each other, where diabetes is one of the causes of cancer and heart disease, hepatitis also is associated with diabetes, and heart disease. This paper focuses on data mining and machine learning techniques in healthcare classification and prediction of diseases and rebuild disease detection systems (DDS). The study suggests finding a classifier among the most common kinds of classification algorithms within a combined approach represent in Bayesian, Trees, Rules, Function, and lazy algorithms to automate a better performance of early detection of diseases from the medical datasets. This paper presents and analyzes five different machine learning (ML) algorithms: Function-based Neural Network (MLP) algorithm. Trees based Decision Tree (ID3) algorithm, Bayesian Theorem based Hidden Naïve Bayes (HNB) algorithm. Lazy based k-nearest neighbors (IBK) algorithm, and Rules-based OneR algorithm. The analysis is based on four benchmark datasets in the healthcare sector, including the Pima Indian Diabetes PID, the Breast Cancer, Heart Cleveland, and Hepatitis Datasets, which were obtained from several ML repositories. The results show that the HNB predicts the best result with a relatively higher Precision, AUROC Statistic, highest accuracy, and performance when compared to MLP, IBK, OneR, ID3 algorithms. IEEE 2021 Conference or Workshop Item PeerReviewed pdf en http://umpir.ump.edu.my/id/eprint/32649/1/Accuracy_and_performance_analysis_for_classification_algorithms_based_on_biomedical_datasets.pdf Al-Hameli, Bassam Abdo and Alsewari, Abdulrahman A. and Khubrani, Mousa and Fakhreldin, Mohammoud (2021) Accuracy and performance analysis for classification algorithms based on biomedical datasets. In: IEEE 2021 International Conference on Software Engineering & Computer Systems and 4th International Conference on Computational Science and Information Management (ICSECS-ICOCSIM), 24-26 August 2021 , Pekan, Pahang, Malaysia. pp. 620-624.. ISBN 978-1-6654-1407-4 https://doi.org/10.1109/ICSECS52883.2021.00119
institution Universiti Malaysia Pahang
building UMP Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaysia Pahang
content_source UMP Institutional Repository
url_provider http://umpir.ump.edu.my/
language English
topic QA Mathematics
spellingShingle QA Mathematics
Al-Hameli, Bassam Abdo
Alsewari, Abdulrahman A.
Khubrani, Mousa
Fakhreldin, Mohammoud
Accuracy and performance analysis for classification algorithms based on biomedical datasets
description Diseases chronic, including heart disease, cancer, diabetes, and obesity, are the main causes of mortality in the United States and accounting for and consuming the majority of the country’s healthcare expenditure. As indicated by recent researches. The main reason for the emergence of these diseases prominently is their relationship to each other, where diabetes is one of the causes of cancer and heart disease, hepatitis also is associated with diabetes, and heart disease. This paper focuses on data mining and machine learning techniques in healthcare classification and prediction of diseases and rebuild disease detection systems (DDS). The study suggests finding a classifier among the most common kinds of classification algorithms within a combined approach represent in Bayesian, Trees, Rules, Function, and lazy algorithms to automate a better performance of early detection of diseases from the medical datasets. This paper presents and analyzes five different machine learning (ML) algorithms: Function-based Neural Network (MLP) algorithm. Trees based Decision Tree (ID3) algorithm, Bayesian Theorem based Hidden Naïve Bayes (HNB) algorithm. Lazy based k-nearest neighbors (IBK) algorithm, and Rules-based OneR algorithm. The analysis is based on four benchmark datasets in the healthcare sector, including the Pima Indian Diabetes PID, the Breast Cancer, Heart Cleveland, and Hepatitis Datasets, which were obtained from several ML repositories. The results show that the HNB predicts the best result with a relatively higher Precision, AUROC Statistic, highest accuracy, and performance when compared to MLP, IBK, OneR, ID3 algorithms.
format Conference or Workshop Item
author Al-Hameli, Bassam Abdo
Alsewari, Abdulrahman A.
Khubrani, Mousa
Fakhreldin, Mohammoud
author_facet Al-Hameli, Bassam Abdo
Alsewari, Abdulrahman A.
Khubrani, Mousa
Fakhreldin, Mohammoud
author_sort Al-Hameli, Bassam Abdo
title Accuracy and performance analysis for classification algorithms based on biomedical datasets
title_short Accuracy and performance analysis for classification algorithms based on biomedical datasets
title_full Accuracy and performance analysis for classification algorithms based on biomedical datasets
title_fullStr Accuracy and performance analysis for classification algorithms based on biomedical datasets
title_full_unstemmed Accuracy and performance analysis for classification algorithms based on biomedical datasets
title_sort accuracy and performance analysis for classification algorithms based on biomedical datasets
publisher IEEE
publishDate 2021
url http://umpir.ump.edu.my/id/eprint/32649/1/Accuracy_and_performance_analysis_for_classification_algorithms_based_on_biomedical_datasets.pdf
http://umpir.ump.edu.my/id/eprint/32649/
https://doi.org/10.1109/ICSECS52883.2021.00119
_version_ 1718926258241273856