Integration of data mining classification techniques and ensemble learning to identify risk factors and diagnose ovarian cancer recurrence

© 2017 Elsevier B.V. Ovarian cancer is the second leading cause of deaths among gynecologic cancers in the world. Approximately 90% of women with ovarian cancer reported having symptoms long before a diagnosis was made. Literature shows that recurrence should be predicted with regard to their person...

Full description

Saved in:
Bibliographic Details
Main Authors: Chih Jen Tseng, Chi Jie Lu, Chi Chang Chang, Gin Den Chen, Chalong Cheewakriangkrai
Format: Journal
Published: 2018
Subjects:
Online Access:https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85020746721&origin=inward
http://cmuir.cmu.ac.th/jspui/handle/6653943832/46662
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Chiang Mai University
id th-cmuir.6653943832-46662
record_format dspace
spelling th-cmuir.6653943832-466622018-04-25T07:24:51Z Integration of data mining classification techniques and ensemble learning to identify risk factors and diagnose ovarian cancer recurrence Chih Jen Tseng Chi Jie Lu Chi Chang Chang Gin Den Chen Chalong Cheewakriangkrai Agricultural and Biological Sciences © 2017 Elsevier B.V. Ovarian cancer is the second leading cause of deaths among gynecologic cancers in the world. Approximately 90% of women with ovarian cancer reported having symptoms long before a diagnosis was made. Literature shows that recurrence should be predicted with regard to their personal risk factors and the clinical symptoms of this devastating cancer. In this study, ensemble learning and five data mining approaches, including support vector machine (SVM), C5.0, extreme learning machine (ELM), multivariate adaptive regression splines (MARS), and random forest (RF), were integrated to rank the importance of risk factors and diagnose the recurrence of ovarian cancer. The medical records and pathologic status were extracted from the Chung Shan Medical University Hospital Tumor Registry. Experimental results illustrated that the integrated C5.0 model is a superior approach in predicting the recurrence of ovarian cancer. Moreover, the classification accuracies of C5.0, ELM, MARS, RF, and SVM indeed increased after using the selected important risk factors as predictors. Our findings suggest that The International Federation of Gynecology and Obstetrics (FIGO), Pathologic M, Age, and Pathologic T were the four most critical risk factors for ovarian cancer recurrence. In summary, the above information can support the important influence of personality and clinical symptom representations on all p hases of guide interventions, with the complexities of multiple symptoms associated with ovarian cancer in all phases of the recurrent trajectory. 2018-04-25T06:59:06Z 2018-04-25T06:59:06Z 2017-05-01 Journal 18732860 09333657 2-s2.0-85020746721 10.1016/j.artmed.2017.06.003 https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85020746721&origin=inward http://cmuir.cmu.ac.th/jspui/handle/6653943832/46662
institution Chiang Mai University
building Chiang Mai University Library
country Thailand
collection CMU Intellectual Repository
topic Agricultural and Biological Sciences
spellingShingle Agricultural and Biological Sciences
Chih Jen Tseng
Chi Jie Lu
Chi Chang Chang
Gin Den Chen
Chalong Cheewakriangkrai
Integration of data mining classification techniques and ensemble learning to identify risk factors and diagnose ovarian cancer recurrence
description © 2017 Elsevier B.V. Ovarian cancer is the second leading cause of deaths among gynecologic cancers in the world. Approximately 90% of women with ovarian cancer reported having symptoms long before a diagnosis was made. Literature shows that recurrence should be predicted with regard to their personal risk factors and the clinical symptoms of this devastating cancer. In this study, ensemble learning and five data mining approaches, including support vector machine (SVM), C5.0, extreme learning machine (ELM), multivariate adaptive regression splines (MARS), and random forest (RF), were integrated to rank the importance of risk factors and diagnose the recurrence of ovarian cancer. The medical records and pathologic status were extracted from the Chung Shan Medical University Hospital Tumor Registry. Experimental results illustrated that the integrated C5.0 model is a superior approach in predicting the recurrence of ovarian cancer. Moreover, the classification accuracies of C5.0, ELM, MARS, RF, and SVM indeed increased after using the selected important risk factors as predictors. Our findings suggest that The International Federation of Gynecology and Obstetrics (FIGO), Pathologic M, Age, and Pathologic T were the four most critical risk factors for ovarian cancer recurrence. In summary, the above information can support the important influence of personality and clinical symptom representations on all p hases of guide interventions, with the complexities of multiple symptoms associated with ovarian cancer in all phases of the recurrent trajectory.
format Journal
author Chih Jen Tseng
Chi Jie Lu
Chi Chang Chang
Gin Den Chen
Chalong Cheewakriangkrai
author_facet Chih Jen Tseng
Chi Jie Lu
Chi Chang Chang
Gin Den Chen
Chalong Cheewakriangkrai
author_sort Chih Jen Tseng
title Integration of data mining classification techniques and ensemble learning to identify risk factors and diagnose ovarian cancer recurrence
title_short Integration of data mining classification techniques and ensemble learning to identify risk factors and diagnose ovarian cancer recurrence
title_full Integration of data mining classification techniques and ensemble learning to identify risk factors and diagnose ovarian cancer recurrence
title_fullStr Integration of data mining classification techniques and ensemble learning to identify risk factors and diagnose ovarian cancer recurrence
title_full_unstemmed Integration of data mining classification techniques and ensemble learning to identify risk factors and diagnose ovarian cancer recurrence
title_sort integration of data mining classification techniques and ensemble learning to identify risk factors and diagnose ovarian cancer recurrence
publishDate 2018
url https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85020746721&origin=inward
http://cmuir.cmu.ac.th/jspui/handle/6653943832/46662
_version_ 1681422916047077376