Classification of cervical cancer using random forest

Cervical cancer is the second most common cancer among Malaysian women between 15 to 44 although the morbidity and the mortality of cervical cancer have been decreasing in recent years. Developing supervised models for cervical cancer is a challenging task. By gleaning deeper insights from the data,...

Full description

Saved in:
Bibliographic Details
Main Authors: Bahirah, Mohd Bashah, Ku Muhammad Naim, Ku Khalif, Nor Azuana, Ramli
Format: Conference or Workshop Item
Language:English
Published: 2022
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/36667/1/Classification%20of%20cervical%20cancer%20using%20random%20forest.pdf
http://umpir.ump.edu.my/id/eprint/36667/
https://ncon-pgr.ump.edu.my/index.php/en/?option=com_fileman&view=file&routed=1&name=E-BOOK%20NCON%202022%20.pdf&folder=E-BOOK%20NCON%202022&container=fileman-files
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Malaysia Pahang Al-Sultan Abdullah
Language: English
id my.ump.umpir.36667
record_format eprints
spelling my.ump.umpir.366672023-02-22T09:11:35Z http://umpir.ump.edu.my/id/eprint/36667/ Classification of cervical cancer using random forest Bahirah, Mohd Bashah Ku Muhammad Naim, Ku Khalif Nor Azuana, Ramli Q Science (General) QA Mathematics RC0254 Neoplasms. Tumors. Oncology (including Cancer) Cervical cancer is the second most common cancer among Malaysian women between 15 to 44 although the morbidity and the mortality of cervical cancer have been decreasing in recent years. Developing supervised models for cervical cancer is a challenging task. By gleaning deeper insights from the data, data mining knowledge has capability to learn from data, identify the patterns with meaningful in that they lead to some advantages in many real-world applications. In this research, the cervical cancer risk classification model was used by using data mining approach which consider Decision Tree and Random Forest algorithm. These two models have been implemented by using JupyterLab on the UCI datasets. Model evaluation has been conducted to identify the robust data mining algorithm in the prediction of cervical cancer risk. The model gives 67% for the precision and 95% of accuracy. However, decision tree is the best method compared to Random Forest since Random Forest has the lowest AUC which indicated that it is the worse model. To improve this study, other method such as Artificial Neural Network, Support Vector Machine or ensemble classifiers can be applied to the dataset to see if there is a better model to predict cervical cancer. 2022-11-15 Conference or Workshop Item PeerReviewed pdf en http://umpir.ump.edu.my/id/eprint/36667/1/Classification%20of%20cervical%20cancer%20using%20random%20forest.pdf Bahirah, Mohd Bashah and Ku Muhammad Naim, Ku Khalif and Nor Azuana, Ramli (2022) Classification of cervical cancer using random forest. In: The 6th National Conference for Postgraduate Research (NCON-PGR 2022), 15 November 2022 , Virtual Conference, Universiti Malaysia Pahang, Malaysia. p. 57.. https://ncon-pgr.ump.edu.my/index.php/en/?option=com_fileman&view=file&routed=1&name=E-BOOK%20NCON%202022%20.pdf&folder=E-BOOK%20NCON%202022&container=fileman-files
institution Universiti Malaysia Pahang Al-Sultan Abdullah
building UMPSA Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaysia Pahang Al-Sultan Abdullah
content_source UMPSA Institutional Repository
url_provider http://umpir.ump.edu.my/
language English
topic Q Science (General)
QA Mathematics
RC0254 Neoplasms. Tumors. Oncology (including Cancer)
spellingShingle Q Science (General)
QA Mathematics
RC0254 Neoplasms. Tumors. Oncology (including Cancer)
Bahirah, Mohd Bashah
Ku Muhammad Naim, Ku Khalif
Nor Azuana, Ramli
Classification of cervical cancer using random forest
description Cervical cancer is the second most common cancer among Malaysian women between 15 to 44 although the morbidity and the mortality of cervical cancer have been decreasing in recent years. Developing supervised models for cervical cancer is a challenging task. By gleaning deeper insights from the data, data mining knowledge has capability to learn from data, identify the patterns with meaningful in that they lead to some advantages in many real-world applications. In this research, the cervical cancer risk classification model was used by using data mining approach which consider Decision Tree and Random Forest algorithm. These two models have been implemented by using JupyterLab on the UCI datasets. Model evaluation has been conducted to identify the robust data mining algorithm in the prediction of cervical cancer risk. The model gives 67% for the precision and 95% of accuracy. However, decision tree is the best method compared to Random Forest since Random Forest has the lowest AUC which indicated that it is the worse model. To improve this study, other method such as Artificial Neural Network, Support Vector Machine or ensemble classifiers can be applied to the dataset to see if there is a better model to predict cervical cancer.
format Conference or Workshop Item
author Bahirah, Mohd Bashah
Ku Muhammad Naim, Ku Khalif
Nor Azuana, Ramli
author_facet Bahirah, Mohd Bashah
Ku Muhammad Naim, Ku Khalif
Nor Azuana, Ramli
author_sort Bahirah, Mohd Bashah
title Classification of cervical cancer using random forest
title_short Classification of cervical cancer using random forest
title_full Classification of cervical cancer using random forest
title_fullStr Classification of cervical cancer using random forest
title_full_unstemmed Classification of cervical cancer using random forest
title_sort classification of cervical cancer using random forest
publishDate 2022
url http://umpir.ump.edu.my/id/eprint/36667/1/Classification%20of%20cervical%20cancer%20using%20random%20forest.pdf
http://umpir.ump.edu.my/id/eprint/36667/
https://ncon-pgr.ump.edu.my/index.php/en/?option=com_fileman&view=file&routed=1&name=E-BOOK%20NCON%202022%20.pdf&folder=E-BOOK%20NCON%202022&container=fileman-files
_version_ 1822923243959353344