KLASIFIKASI SVM (SUPPORT VECTOR MACHINE) DAN KOMBINASI SELEKSI FITUR PADA DIAGNOSIS KANKER PAYUDARA

of breast cancer has been widely implemented using machine learning. However, in medical data analysis, breast cancer diagnosis is usually faced with high dimensional features. The high dimensional features sometimes contain irrelevant features toward the classification process. Feature selection is...

Full description

Saved in:
Bibliographic Details
Main Authors: , Elvira Sukma Wahyuni, , Noor Akhmad Setiawan, S.T., M.T., Ph.D.
Format: Theses and Dissertations NonPeerReviewed
Published: [Yogyakarta] : Universitas Gadjah Mada 2014
Subjects:
ETD
Online Access:https://repository.ugm.ac.id/133398/
http://etd.ugm.ac.id/index.php?mod=penelitian_detail&sub=PenelitianDetail&act=view&typ=html&buku_id=74052
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universitas Gadjah Mada
Description
Summary:of breast cancer has been widely implemented using machine learning. However, in medical data analysis, breast cancer diagnosis is usually faced with high dimensional features. The high dimensional features sometimes contain irrelevant features toward the classification process. Feature selection is a method to eliminate irrelevant features. It can improve the performance of diagnosis. The objective of this research is to develop a feature selection method for breast cancer diagnosis based on rough set and F-score combination. Performance of combination features selection was applied in Wisconsin Breast Cancer Dataset (WBCD). F-score feature selection method and Rough set are combined subsequently by applied Rough set firstly. Than the result of reduced subset feature by Rough set will be selected with F-score feature selection method. Improvement the performance of diagnosis would be evaluated based on the average of sensitivity, ROC AUC, accuracy, and running time with 100 times experiment. Furthermore, the results would be compared with the performance of feature selection method when it is applied individually and simultaneously. The result shows that the combination of F-score feature selection method and rough set achieves the optimal feature and superior performance compared with F-score and Rough set when applied individually. The obtained of sensitivity 0.9714, ROC AUC 0.9700, accuracy 97.05%, and the running time 0.0722 s. Keywords : Feature selection, Rough set, F-score, SVM classification, Breast cancer diagnosis