Evaluating the use of different machine learning techniques to predict traditional Chinese medicine diagnosis

Traditional Chinese Medicine (TCM) diagnostic features uses patient perceptions which can aid in the development of robust diagnostic tools. We explored the information value of these features by modelling the TCM diagnostic process with machine learning, namely decision tree, random forest, and mul...

Full description

Saved in:

Bibliographic Details
Main Author:	Kon, Wen Xuan
Other Authors:	Goh Wen Bin Wilson
Format:	Final Year Project
Language:	English
Published:	Nanyang Technological University 2021
Subjects:	Science::Biological sciences
Online Access:	https://hdl.handle.net/10356/152316
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-152316
record_format	dspace
spelling	sg-ntu-dr.10356-1523162023-02-28T18:08:23Z Evaluating the use of different machine learning techniques to predict traditional Chinese medicine diagnosis Kon, Wen Xuan Goh Wen Bin Wilson School of Biological Sciences wilsongoh@ntu.edu.sg Science::Biological sciences Traditional Chinese Medicine (TCM) diagnostic features uses patient perceptions which can aid in the development of robust diagnostic tools. We explored the information value of these features by modelling the TCM diagnostic process with machine learning, namely decision tree, random forest, and multi-layered perceptron (MLP). Before evaluating the performance, different metrics are tested using dummy models. Accuracy and balanced accuracy are deemed unsuitable due to the large true negatives (TN) that inflates the metrics. Precision and recall are also not suitable to be used alone to determine the overall performance. The F1 and threat scores are suitable metrics that ignores the large TN. The Matthew’s corelation coefficient (MCC) is the best metric as it computes TN as part of the performance while being inert to the class imbalance. The percentage of correct predicted diagnosis may only be useful in determining whether the models are functioning. Although the 3 models did not perform well, MLP had the best performance. Some ways to improve the performance of the models include better record keeping of the TCM data, synthetic minority over sampling (SMOTE), top performing feature selection, using more complex models, and optimisation. Bachelor of Science in Biological Sciences 2021-08-02T02:19:56Z 2021-08-02T02:19:56Z 2021 Final Year Project (FYP) Kon, W. X. (2021). Evaluating the use of different machine learning techniques to predict traditional Chinese medicine diagnosis. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/152316 https://hdl.handle.net/10356/152316 en application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Science::Biological sciences
spellingShingle	Science::Biological sciences Kon, Wen Xuan Evaluating the use of different machine learning techniques to predict traditional Chinese medicine diagnosis
description	Traditional Chinese Medicine (TCM) diagnostic features uses patient perceptions which can aid in the development of robust diagnostic tools. We explored the information value of these features by modelling the TCM diagnostic process with machine learning, namely decision tree, random forest, and multi-layered perceptron (MLP). Before evaluating the performance, different metrics are tested using dummy models. Accuracy and balanced accuracy are deemed unsuitable due to the large true negatives (TN) that inflates the metrics. Precision and recall are also not suitable to be used alone to determine the overall performance. The F1 and threat scores are suitable metrics that ignores the large TN. The Matthew’s corelation coefficient (MCC) is the best metric as it computes TN as part of the performance while being inert to the class imbalance. The percentage of correct predicted diagnosis may only be useful in determining whether the models are functioning. Although the 3 models did not perform well, MLP had the best performance. Some ways to improve the performance of the models include better record keeping of the TCM data, synthetic minority over sampling (SMOTE), top performing feature selection, using more complex models, and optimisation.
author2	Goh Wen Bin Wilson
author_facet	Goh Wen Bin Wilson Kon, Wen Xuan
format	Final Year Project
author	Kon, Wen Xuan
author_sort	Kon, Wen Xuan
title	Evaluating the use of different machine learning techniques to predict traditional Chinese medicine diagnosis
title_short	Evaluating the use of different machine learning techniques to predict traditional Chinese medicine diagnosis
title_full	Evaluating the use of different machine learning techniques to predict traditional Chinese medicine diagnosis
title_fullStr	Evaluating the use of different machine learning techniques to predict traditional Chinese medicine diagnosis
title_full_unstemmed	Evaluating the use of different machine learning techniques to predict traditional Chinese medicine diagnosis
title_sort	evaluating the use of different machine learning techniques to predict traditional chinese medicine diagnosis
publisher	Nanyang Technological University
publishDate	2021
url	https://hdl.handle.net/10356/152316
_version_	1759856153757483008

Evaluating the use of different machine learning techniques to predict traditional Chinese medicine diagnosis

Similar Items