Dealing with missing values for effective prediction of NPC recurrence

This paper aims to investigate missing data techniques for effective prediction of nasopharyngeal carcinoma (NPC) recurrence. The techniques include listwise deletion, imputations by mean, k-nearest neighbor, and expectation maximization. The completed data are used to predict the presence or absenc...

Full description

Saved in:
Bibliographic Details
Main Authors: Orrawan Kumdee, Panrasee Ritthipravat, Thongchai Bhongmakapat, Wichit Cheewaruangroj
Other Authors: Mahidol University
Format: Conference or Workshop Item
Published: 2018
Subjects:
Online Access:https://repository.li.mahidol.ac.th/handle/123456789/19118
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Mahidol University
id th-mahidol.19118
record_format dspace
spelling th-mahidol.191182018-07-12T09:25:57Z Dealing with missing values for effective prediction of NPC recurrence Orrawan Kumdee Panrasee Ritthipravat Thongchai Bhongmakapat Wichit Cheewaruangroj Mahidol University Faculty of Medicine, Ramathibodi Hospital, Mahidol University Computer Science Engineering This paper aims to investigate missing data techniques for effective prediction of nasopharyngeal carcinoma (NPC) recurrence. The techniques include listwise deletion, imputations by mean, k-nearest neighbor, and expectation maximization. The completed data are used to predict the presence or absence of NPC recurrence in each year by means of logistic regression, multilayer perceptron with backpropagation training, and naïve bayes. Five year predictions are carried out. Validity of each predictive model is assured by 10-fold cross validation. Their results are compared in order to determine proper missing data treatment and the most efficient prediction technique. The results showed that EM imputation was superior to the other missing data techniques because it can be efficiently applied to all predictive models. The multilayer perceptron with backpropagation training gave the highest prediction performance and it was the most robust to the data completed by different missing data techniques. © 2008 SICE. 2018-07-12T02:24:11Z 2018-07-12T02:24:11Z 2008-12-01 Conference Paper Proceedings of the SICE Annual Conference. (2008), 1290-1294 10.1109/SICE.2008.4654856 2-s2.0-56749177788 https://repository.li.mahidol.ac.th/handle/123456789/19118 Mahidol University SCOPUS https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=56749177788&origin=inward
institution Mahidol University
building Mahidol University Library
continent Asia
country Thailand
Thailand
content_provider Mahidol University Library
collection Mahidol University Institutional Repository
topic Computer Science
Engineering
spellingShingle Computer Science
Engineering
Orrawan Kumdee
Panrasee Ritthipravat
Thongchai Bhongmakapat
Wichit Cheewaruangroj
Dealing with missing values for effective prediction of NPC recurrence
description This paper aims to investigate missing data techniques for effective prediction of nasopharyngeal carcinoma (NPC) recurrence. The techniques include listwise deletion, imputations by mean, k-nearest neighbor, and expectation maximization. The completed data are used to predict the presence or absence of NPC recurrence in each year by means of logistic regression, multilayer perceptron with backpropagation training, and naïve bayes. Five year predictions are carried out. Validity of each predictive model is assured by 10-fold cross validation. Their results are compared in order to determine proper missing data treatment and the most efficient prediction technique. The results showed that EM imputation was superior to the other missing data techniques because it can be efficiently applied to all predictive models. The multilayer perceptron with backpropagation training gave the highest prediction performance and it was the most robust to the data completed by different missing data techniques. © 2008 SICE.
author2 Mahidol University
author_facet Mahidol University
Orrawan Kumdee
Panrasee Ritthipravat
Thongchai Bhongmakapat
Wichit Cheewaruangroj
format Conference or Workshop Item
author Orrawan Kumdee
Panrasee Ritthipravat
Thongchai Bhongmakapat
Wichit Cheewaruangroj
author_sort Orrawan Kumdee
title Dealing with missing values for effective prediction of NPC recurrence
title_short Dealing with missing values for effective prediction of NPC recurrence
title_full Dealing with missing values for effective prediction of NPC recurrence
title_fullStr Dealing with missing values for effective prediction of NPC recurrence
title_full_unstemmed Dealing with missing values for effective prediction of NPC recurrence
title_sort dealing with missing values for effective prediction of npc recurrence
publishDate 2018
url https://repository.li.mahidol.ac.th/handle/123456789/19118
_version_ 1763491511263035392