Breast cancer prediction: A comparative study using machine learning techniques

Early detection of disease has become a crucial problem due to rapid population growth in medical research in recent times. With the rapid population growth, the risk of death incurred by breast cancer is rising exponentially. Breast cancer is the second most severe cancer among all of the cancers a...

Full description

Saved in:
Bibliographic Details
Main Authors: Islam, Md. Milon, Haque, Md. Rezwanul, Iqbal, Hasib, Hasan, Md. Munirul, Hasan, Mahmudul, Kabir, Muhammad Nomani
Format: Article
Language:English
Published: Springer 2020
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/29306/1/Breast%20Cancer%20Prediction...A%20Comparative%20Study%20Using%20Machine.pdf
http://umpir.ump.edu.my/id/eprint/29306/
https://doi.org/10.1007/s42979-020-00305-w
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Malaysia Pahang
Language: English
Description
Summary:Early detection of disease has become a crucial problem due to rapid population growth in medical research in recent times. With the rapid population growth, the risk of death incurred by breast cancer is rising exponentially. Breast cancer is the second most severe cancer among all of the cancers already unveiled. An automatic disease detection system aids medical staffs in disease diagnosis and offers reliable, effective, and rapid response as well as decreases the risk of death. In this paper, we compare five supervised machine learning techniques named support vector machine (SVM), K-nearest neighbors, random forests, artificial neural networks (ANNs) and logistic regression. The Wisconsin Breast Cancer dataset is obtained from a prominent machine learning database named UCI machine learning database. The performance of the study is measured with respect to accuracy, sensitivity, specificity, precision, negative predictive value, false-negative rate, false-positive rate, F1 score, and Matthews Correlation Coefficient. Additionally, these techniques were appraised on precision–recall area under curve and receiver operating characteristic curve. The results reveal that the ANNs obtained the highest accuracy, precision, and F1 score of 98.57%, 97.82%, and 0.9890, respectively, whereas 97.14%, 95.65%, and 0.9777 accuracy, precision, and F1 score are obtained by SVM, respectively.