Classification of cervical cancer using random forest
Cervical cancer is the second most common cancer among Malaysian women between 15 to 44 although the morbidity and the mortality of cervical cancer have been decreasing in recent years. Developing supervised models for cervical cancer is a challenging task. By gleaning deeper insights from the data,...
Saved in:
Main Authors: | , , |
---|---|
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2022
|
Subjects: | |
Online Access: | http://umpir.ump.edu.my/id/eprint/36667/1/Classification%20of%20cervical%20cancer%20using%20random%20forest.pdf http://umpir.ump.edu.my/id/eprint/36667/ https://ncon-pgr.ump.edu.my/index.php/en/?option=com_fileman&view=file&routed=1&name=E-BOOK%20NCON%202022%20.pdf&folder=E-BOOK%20NCON%202022&container=fileman-files |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Malaysia Pahang Al-Sultan Abdullah |
Language: | English |
id |
my.ump.umpir.36667 |
---|---|
record_format |
eprints |
spelling |
my.ump.umpir.366672023-02-22T09:11:35Z http://umpir.ump.edu.my/id/eprint/36667/ Classification of cervical cancer using random forest Bahirah, Mohd Bashah Ku Muhammad Naim, Ku Khalif Nor Azuana, Ramli Q Science (General) QA Mathematics RC0254 Neoplasms. Tumors. Oncology (including Cancer) Cervical cancer is the second most common cancer among Malaysian women between 15 to 44 although the morbidity and the mortality of cervical cancer have been decreasing in recent years. Developing supervised models for cervical cancer is a challenging task. By gleaning deeper insights from the data, data mining knowledge has capability to learn from data, identify the patterns with meaningful in that they lead to some advantages in many real-world applications. In this research, the cervical cancer risk classification model was used by using data mining approach which consider Decision Tree and Random Forest algorithm. These two models have been implemented by using JupyterLab on the UCI datasets. Model evaluation has been conducted to identify the robust data mining algorithm in the prediction of cervical cancer risk. The model gives 67% for the precision and 95% of accuracy. However, decision tree is the best method compared to Random Forest since Random Forest has the lowest AUC which indicated that it is the worse model. To improve this study, other method such as Artificial Neural Network, Support Vector Machine or ensemble classifiers can be applied to the dataset to see if there is a better model to predict cervical cancer. 2022-11-15 Conference or Workshop Item PeerReviewed pdf en http://umpir.ump.edu.my/id/eprint/36667/1/Classification%20of%20cervical%20cancer%20using%20random%20forest.pdf Bahirah, Mohd Bashah and Ku Muhammad Naim, Ku Khalif and Nor Azuana, Ramli (2022) Classification of cervical cancer using random forest. In: The 6th National Conference for Postgraduate Research (NCON-PGR 2022), 15 November 2022 , Virtual Conference, Universiti Malaysia Pahang, Malaysia. p. 57.. https://ncon-pgr.ump.edu.my/index.php/en/?option=com_fileman&view=file&routed=1&name=E-BOOK%20NCON%202022%20.pdf&folder=E-BOOK%20NCON%202022&container=fileman-files |
institution |
Universiti Malaysia Pahang Al-Sultan Abdullah |
building |
UMPSA Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Malaysia Pahang Al-Sultan Abdullah |
content_source |
UMPSA Institutional Repository |
url_provider |
http://umpir.ump.edu.my/ |
language |
English |
topic |
Q Science (General) QA Mathematics RC0254 Neoplasms. Tumors. Oncology (including Cancer) |
spellingShingle |
Q Science (General) QA Mathematics RC0254 Neoplasms. Tumors. Oncology (including Cancer) Bahirah, Mohd Bashah Ku Muhammad Naim, Ku Khalif Nor Azuana, Ramli Classification of cervical cancer using random forest |
description |
Cervical cancer is the second most common cancer among Malaysian women between 15 to 44 although the morbidity and the mortality of cervical cancer have been decreasing in recent years. Developing supervised models for cervical cancer is a challenging task. By gleaning deeper insights from the data, data mining knowledge has capability to learn from data, identify the patterns with meaningful in that they lead to some advantages in many real-world applications. In this research, the cervical cancer risk classification model was used by using data mining approach which consider Decision Tree and Random Forest algorithm. These two models have been implemented by using JupyterLab on the UCI datasets. Model evaluation has been conducted to identify the robust data mining algorithm in the prediction of cervical cancer risk. The model gives 67% for the precision and 95% of accuracy. However, decision tree is the best method compared to Random Forest since Random Forest has the lowest AUC which indicated that it is the worse model. To improve this study, other method such as Artificial Neural Network, Support Vector Machine or ensemble classifiers can be applied to the dataset to see if there is a better model to predict cervical cancer. |
format |
Conference or Workshop Item |
author |
Bahirah, Mohd Bashah Ku Muhammad Naim, Ku Khalif Nor Azuana, Ramli |
author_facet |
Bahirah, Mohd Bashah Ku Muhammad Naim, Ku Khalif Nor Azuana, Ramli |
author_sort |
Bahirah, Mohd Bashah |
title |
Classification of cervical cancer using random forest |
title_short |
Classification of cervical cancer using random forest |
title_full |
Classification of cervical cancer using random forest |
title_fullStr |
Classification of cervical cancer using random forest |
title_full_unstemmed |
Classification of cervical cancer using random forest |
title_sort |
classification of cervical cancer using random forest |
publishDate |
2022 |
url |
http://umpir.ump.edu.my/id/eprint/36667/1/Classification%20of%20cervical%20cancer%20using%20random%20forest.pdf http://umpir.ump.edu.my/id/eprint/36667/ https://ncon-pgr.ump.edu.my/index.php/en/?option=com_fileman&view=file&routed=1&name=E-BOOK%20NCON%202022%20.pdf&folder=E-BOOK%20NCON%202022&container=fileman-files |
_version_ |
1822923243959353344 |