Classification of complete blood count and haemoglobin typing data by a C4.5 decision tree, a naïve Bayes classifier and a multilayer perceptron for thalassaemia screening

This article presents the classification of blood characteristics by a C4.5 decision tree, a naïve Bayes classifier and a multilayer perceptron for thalassaemia screening. The aim is to classify eighteen classes of thalassaemia abnormality, which have a high prevalence in Thailand, and one control c...

Full description

Saved in:
Bibliographic Details
Main Authors: Damrongrit Setsirichok, Theera Piroonratana, Waranyu Wongseree, Touchpong Usavanarong, Nuttawut Paulkhaolarn, Chompunut Kanjanakorn, Monchan Sirikong, Chanin Limwongse, Nachol Chaiyaratana
Other Authors: King Mongkut's University of Technology North Bangkok
Format: Article
Published: 2018
Subjects:
Online Access:https://repository.li.mahidol.ac.th/handle/123456789/14051
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Mahidol University
id th-mahidol.14051
record_format dspace
spelling th-mahidol.140512018-06-11T12:15:17Z Classification of complete blood count and haemoglobin typing data by a C4.5 decision tree, a naïve Bayes classifier and a multilayer perceptron for thalassaemia screening Damrongrit Setsirichok Theera Piroonratana Waranyu Wongseree Touchpong Usavanarong Nuttawut Paulkhaolarn Chompunut Kanjanakorn Monchan Sirikong Chanin Limwongse Nachol Chaiyaratana King Mongkut's University of Technology North Bangkok Mahidol University Computer Science Medicine This article presents the classification of blood characteristics by a C4.5 decision tree, a naïve Bayes classifier and a multilayer perceptron for thalassaemia screening. The aim is to classify eighteen classes of thalassaemia abnormality, which have a high prevalence in Thailand, and one control class by inspecting data characterised by a complete blood count (CBC) and haemoglobin typing. Two indices namely a haemoglobin concentration (HB) and a mean corpuscular volume (MCV) are the chosen CBC attributes. On the other hand, known types of haemoglobin from six ranges of retention time identified via high performance liquid chromatography (HPLC) are the chosen haemoglobin typing attributes. The stratified 10-fold cross-validation results indicate that the best classification performance with average accuracy of 93.23% (standard deviation = 1.67%) and 92.60% (standard deviation = 1.75%) is achieved when the naïve Bayes classifier and the multilayer perceptron are respectively applied to samples which have been pre-processed by attribute discretisation. The results also suggest that the HB attribute is redundant. Moreover, the achieved classification performance is significantly higher than that obtained using only haemoglobin typing attributes as classifier inputs. Subsequently, the naïve Bayes classifier and the multilayer perceptron are applied to an additional data set in a clinical trial which respectively results in accuracy of 99.39% and 99.71%. These results suggest that a combination of CBC and haemoglobin typing analysis with a naïve Bayes classifier or a multilayer perceptron is highly suitable for automatic thalassaemia screening. © 2011 Elsevier Ltd. All rights reserved. 2018-06-11T04:45:38Z 2018-06-11T04:45:38Z 2012-03-01 Article Biomedical Signal Processing and Control. Vol.7, No.2 (2012), 202-212 10.1016/j.bspc.2011.03.007 17468108 17468094 2-s2.0-84857373261 https://repository.li.mahidol.ac.th/handle/123456789/14051 Mahidol University SCOPUS https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84857373261&origin=inward
institution Mahidol University
building Mahidol University Library
continent Asia
country Thailand
Thailand
content_provider Mahidol University Library
collection Mahidol University Institutional Repository
topic Computer Science
Medicine
spellingShingle Computer Science
Medicine
Damrongrit Setsirichok
Theera Piroonratana
Waranyu Wongseree
Touchpong Usavanarong
Nuttawut Paulkhaolarn
Chompunut Kanjanakorn
Monchan Sirikong
Chanin Limwongse
Nachol Chaiyaratana
Classification of complete blood count and haemoglobin typing data by a C4.5 decision tree, a naïve Bayes classifier and a multilayer perceptron for thalassaemia screening
description This article presents the classification of blood characteristics by a C4.5 decision tree, a naïve Bayes classifier and a multilayer perceptron for thalassaemia screening. The aim is to classify eighteen classes of thalassaemia abnormality, which have a high prevalence in Thailand, and one control class by inspecting data characterised by a complete blood count (CBC) and haemoglobin typing. Two indices namely a haemoglobin concentration (HB) and a mean corpuscular volume (MCV) are the chosen CBC attributes. On the other hand, known types of haemoglobin from six ranges of retention time identified via high performance liquid chromatography (HPLC) are the chosen haemoglobin typing attributes. The stratified 10-fold cross-validation results indicate that the best classification performance with average accuracy of 93.23% (standard deviation = 1.67%) and 92.60% (standard deviation = 1.75%) is achieved when the naïve Bayes classifier and the multilayer perceptron are respectively applied to samples which have been pre-processed by attribute discretisation. The results also suggest that the HB attribute is redundant. Moreover, the achieved classification performance is significantly higher than that obtained using only haemoglobin typing attributes as classifier inputs. Subsequently, the naïve Bayes classifier and the multilayer perceptron are applied to an additional data set in a clinical trial which respectively results in accuracy of 99.39% and 99.71%. These results suggest that a combination of CBC and haemoglobin typing analysis with a naïve Bayes classifier or a multilayer perceptron is highly suitable for automatic thalassaemia screening. © 2011 Elsevier Ltd. All rights reserved.
author2 King Mongkut's University of Technology North Bangkok
author_facet King Mongkut's University of Technology North Bangkok
Damrongrit Setsirichok
Theera Piroonratana
Waranyu Wongseree
Touchpong Usavanarong
Nuttawut Paulkhaolarn
Chompunut Kanjanakorn
Monchan Sirikong
Chanin Limwongse
Nachol Chaiyaratana
format Article
author Damrongrit Setsirichok
Theera Piroonratana
Waranyu Wongseree
Touchpong Usavanarong
Nuttawut Paulkhaolarn
Chompunut Kanjanakorn
Monchan Sirikong
Chanin Limwongse
Nachol Chaiyaratana
author_sort Damrongrit Setsirichok
title Classification of complete blood count and haemoglobin typing data by a C4.5 decision tree, a naïve Bayes classifier and a multilayer perceptron for thalassaemia screening
title_short Classification of complete blood count and haemoglobin typing data by a C4.5 decision tree, a naïve Bayes classifier and a multilayer perceptron for thalassaemia screening
title_full Classification of complete blood count and haemoglobin typing data by a C4.5 decision tree, a naïve Bayes classifier and a multilayer perceptron for thalassaemia screening
title_fullStr Classification of complete blood count and haemoglobin typing data by a C4.5 decision tree, a naïve Bayes classifier and a multilayer perceptron for thalassaemia screening
title_full_unstemmed Classification of complete blood count and haemoglobin typing data by a C4.5 decision tree, a naïve Bayes classifier and a multilayer perceptron for thalassaemia screening
title_sort classification of complete blood count and haemoglobin typing data by a c4.5 decision tree, a naïve bayes classifier and a multilayer perceptron for thalassaemia screening
publishDate 2018
url https://repository.li.mahidol.ac.th/handle/123456789/14051
_version_ 1763488771674734592