Evaluating the use of different machine learning techniques to predict traditional Chinese medicine diagnosis

Traditional Chinese Medicine (TCM) diagnostic features uses patient perceptions which can aid in the development of robust diagnostic tools. We explored the information value of these features by modelling the TCM diagnostic process with machine learning, namely decision tree, random forest, and mul...

Full description

Saved in:
Bibliographic Details
Main Author: Kon, Wen Xuan
Other Authors: Goh Wen Bin Wilson
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/152316
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-152316
record_format dspace
spelling sg-ntu-dr.10356-1523162023-02-28T18:08:23Z Evaluating the use of different machine learning techniques to predict traditional Chinese medicine diagnosis Kon, Wen Xuan Goh Wen Bin Wilson School of Biological Sciences wilsongoh@ntu.edu.sg Science::Biological sciences Traditional Chinese Medicine (TCM) diagnostic features uses patient perceptions which can aid in the development of robust diagnostic tools. We explored the information value of these features by modelling the TCM diagnostic process with machine learning, namely decision tree, random forest, and multi-layered perceptron (MLP). Before evaluating the performance, different metrics are tested using dummy models. Accuracy and balanced accuracy are deemed unsuitable due to the large true negatives (TN) that inflates the metrics. Precision and recall are also not suitable to be used alone to determine the overall performance. The F1 and threat scores are suitable metrics that ignores the large TN. The Matthew’s corelation coefficient (MCC) is the best metric as it computes TN as part of the performance while being inert to the class imbalance. The percentage of correct predicted diagnosis may only be useful in determining whether the models are functioning. Although the 3 models did not perform well, MLP had the best performance. Some ways to improve the performance of the models include better record keeping of the TCM data, synthetic minority over sampling (SMOTE), top performing feature selection, using more complex models, and optimisation. Bachelor of Science in Biological Sciences 2021-08-02T02:19:56Z 2021-08-02T02:19:56Z 2021 Final Year Project (FYP) Kon, W. X. (2021). Evaluating the use of different machine learning techniques to predict traditional Chinese medicine diagnosis. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/152316 https://hdl.handle.net/10356/152316 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Science::Biological sciences
spellingShingle Science::Biological sciences
Kon, Wen Xuan
Evaluating the use of different machine learning techniques to predict traditional Chinese medicine diagnosis
description Traditional Chinese Medicine (TCM) diagnostic features uses patient perceptions which can aid in the development of robust diagnostic tools. We explored the information value of these features by modelling the TCM diagnostic process with machine learning, namely decision tree, random forest, and multi-layered perceptron (MLP). Before evaluating the performance, different metrics are tested using dummy models. Accuracy and balanced accuracy are deemed unsuitable due to the large true negatives (TN) that inflates the metrics. Precision and recall are also not suitable to be used alone to determine the overall performance. The F1 and threat scores are suitable metrics that ignores the large TN. The Matthew’s corelation coefficient (MCC) is the best metric as it computes TN as part of the performance while being inert to the class imbalance. The percentage of correct predicted diagnosis may only be useful in determining whether the models are functioning. Although the 3 models did not perform well, MLP had the best performance. Some ways to improve the performance of the models include better record keeping of the TCM data, synthetic minority over sampling (SMOTE), top performing feature selection, using more complex models, and optimisation.
author2 Goh Wen Bin Wilson
author_facet Goh Wen Bin Wilson
Kon, Wen Xuan
format Final Year Project
author Kon, Wen Xuan
author_sort Kon, Wen Xuan
title Evaluating the use of different machine learning techniques to predict traditional Chinese medicine diagnosis
title_short Evaluating the use of different machine learning techniques to predict traditional Chinese medicine diagnosis
title_full Evaluating the use of different machine learning techniques to predict traditional Chinese medicine diagnosis
title_fullStr Evaluating the use of different machine learning techniques to predict traditional Chinese medicine diagnosis
title_full_unstemmed Evaluating the use of different machine learning techniques to predict traditional Chinese medicine diagnosis
title_sort evaluating the use of different machine learning techniques to predict traditional chinese medicine diagnosis
publisher Nanyang Technological University
publishDate 2021
url https://hdl.handle.net/10356/152316
_version_ 1759856153757483008