LDSVM: Leukemia Cancer Classification Using Machine Learning

Leukemia is blood cancer, including bone marrow and lymphatic tissues, typically involving white blood cells. Leukemia produces an abnormal amount of white blood cells compared to normal blood. Deoxyribonucleic acid (DNA) microarrays provide reliable medical diagnostic services to help more pati...

Full description

Saved in:
Bibliographic Details
Main Authors: Karim, Abdul, Azhari, Azhari, Shahroz, Mobeen, Belhaouri, Samir Brahim, Mustofa, Khabib
Format: Other NonPeerReviewed
Language:English
Published: Computers, Materials and Continua 2022
Subjects:
Online Access:https://repository.ugm.ac.id/283929/1/103.LDSVM-Leukemia-cancer-classification-using-machine-learningComputers-Materials-and-Continua.pdf
https://repository.ugm.ac.id/283929/
https://www.techscience.com/cmc/v71n2/45786
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universitas Gadjah Mada
Language: English
id id-ugm-repo.283929
record_format dspace
spelling id-ugm-repo.2839292023-11-24T08:19:28Z https://repository.ugm.ac.id/283929/ LDSVM: Leukemia Cancer Classification Using Machine Learning Karim, Abdul Azhari, Azhari Shahroz, Mobeen Belhaouri, Samir Brahim Mustofa, Khabib Information and Computing Sciences Leukemia is blood cancer, including bone marrow and lymphatic tissues, typically involving white blood cells. Leukemia produces an abnormal amount of white blood cells compared to normal blood. Deoxyribonucleic acid (DNA) microarrays provide reliable medical diagnostic services to help more patients find the proposed treatment for infections. DNA microarrays are also known as biochips that consist of microscopic DNA spots attached to a solid glass surface. Currently, it is difficult to classify cancers using microarray data. Nearly many data mining techniques have failed because of the small sample size, which has become more critical for organizations. However, they are not highly effective in improving results and are frequently employed by doctors for cancer diagnosis. This study proposes a novel method using machine learning algorithms based on microarrays of leukemia GSE9476 cells. The main aim was to predict the initial leukemia disease. Machine learning algorithms such as decision tree (DT), naive bayes (NB), random forest (RF), gradient boosting machine (GBM), linear regression (LinR), support vector machine (SVM), and novel approach based on the combination of Logistic Regression (LR), DT and SVM named as ensemble LDSVM model. The k-fold cross-validation and grid search optimization methods were used with the LDSVM model to classify leukemia in patients and comparatively analyze their impacts. The proposed approach evaluated better accuracy, precision, recall, and f1 scores than the other algorithms. Furthermore, the results were relatively assessed, which showed LDSVM performance. This study aims to successfully predict leukemia in patients and enhance prediction accuracy in minimum time. Moreover, a Synthetic minority oversampling technique (SMOTE) and Principal compenent analysis (PCA) approaches were implemented. This makes the records generalized and evaluates the outcomeswell.PCAreduces the feature count without losing any information and deals with class imbalanced datasets, as well as faster model execution along with less computation cost. In this study, a novel process was used to reduce the column results to develop a faster and more rapid experiment execution. Computers, Materials and Continua 2022 Other NonPeerReviewed application/pdf en https://repository.ugm.ac.id/283929/1/103.LDSVM-Leukemia-cancer-classification-using-machine-learningComputers-Materials-and-Continua.pdf Karim, Abdul and Azhari, Azhari and Shahroz, Mobeen and Belhaouri, Samir Brahim and Mustofa, Khabib (2022) LDSVM: Leukemia Cancer Classification Using Machine Learning. Computers, Materials and Continua. https://www.techscience.com/cmc/v71n2/45786 DOI:10.32604/cmc.2022.021218
institution Universitas Gadjah Mada
building UGM Library
continent Asia
country Indonesia
Indonesia
content_provider UGM Library
collection Repository Civitas UGM
language English
topic Information and Computing Sciences
spellingShingle Information and Computing Sciences
Karim, Abdul
Azhari, Azhari
Shahroz, Mobeen
Belhaouri, Samir Brahim
Mustofa, Khabib
LDSVM: Leukemia Cancer Classification Using Machine Learning
description Leukemia is blood cancer, including bone marrow and lymphatic tissues, typically involving white blood cells. Leukemia produces an abnormal amount of white blood cells compared to normal blood. Deoxyribonucleic acid (DNA) microarrays provide reliable medical diagnostic services to help more patients find the proposed treatment for infections. DNA microarrays are also known as biochips that consist of microscopic DNA spots attached to a solid glass surface. Currently, it is difficult to classify cancers using microarray data. Nearly many data mining techniques have failed because of the small sample size, which has become more critical for organizations. However, they are not highly effective in improving results and are frequently employed by doctors for cancer diagnosis. This study proposes a novel method using machine learning algorithms based on microarrays of leukemia GSE9476 cells. The main aim was to predict the initial leukemia disease. Machine learning algorithms such as decision tree (DT), naive bayes (NB), random forest (RF), gradient boosting machine (GBM), linear regression (LinR), support vector machine (SVM), and novel approach based on the combination of Logistic Regression (LR), DT and SVM named as ensemble LDSVM model. The k-fold cross-validation and grid search optimization methods were used with the LDSVM model to classify leukemia in patients and comparatively analyze their impacts. The proposed approach evaluated better accuracy, precision, recall, and f1 scores than the other algorithms. Furthermore, the results were relatively assessed, which showed LDSVM performance. This study aims to successfully predict leukemia in patients and enhance prediction accuracy in minimum time. Moreover, a Synthetic minority oversampling technique (SMOTE) and Principal compenent analysis (PCA) approaches were implemented. This makes the records generalized and evaluates the outcomeswell.PCAreduces the feature count without losing any information and deals with class imbalanced datasets, as well as faster model execution along with less computation cost. In this study, a novel process was used to reduce the column results to develop a faster and more rapid experiment execution.
format Other
NonPeerReviewed
author Karim, Abdul
Azhari, Azhari
Shahroz, Mobeen
Belhaouri, Samir Brahim
Mustofa, Khabib
author_facet Karim, Abdul
Azhari, Azhari
Shahroz, Mobeen
Belhaouri, Samir Brahim
Mustofa, Khabib
author_sort Karim, Abdul
title LDSVM: Leukemia Cancer Classification Using Machine Learning
title_short LDSVM: Leukemia Cancer Classification Using Machine Learning
title_full LDSVM: Leukemia Cancer Classification Using Machine Learning
title_fullStr LDSVM: Leukemia Cancer Classification Using Machine Learning
title_full_unstemmed LDSVM: Leukemia Cancer Classification Using Machine Learning
title_sort ldsvm: leukemia cancer classification using machine learning
publisher Computers, Materials and Continua
publishDate 2022
url https://repository.ugm.ac.id/283929/1/103.LDSVM-Leukemia-cancer-classification-using-machine-learningComputers-Materials-and-Continua.pdf
https://repository.ugm.ac.id/283929/
https://www.techscience.com/cmc/v71n2/45786
_version_ 1783956525183664128