Evaluating the use of different machine learning techniques to predict traditional Chinese medicine diagnosis
Traditional Chinese Medicine (TCM) diagnostic features uses patient perceptions which can aid in the development of robust diagnostic tools. We explored the information value of these features by modelling the TCM diagnostic process with machine learning, namely decision tree, random forest, and mul...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2021
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/152316 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-152316 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1523162023-02-28T18:08:23Z Evaluating the use of different machine learning techniques to predict traditional Chinese medicine diagnosis Kon, Wen Xuan Goh Wen Bin Wilson School of Biological Sciences wilsongoh@ntu.edu.sg Science::Biological sciences Traditional Chinese Medicine (TCM) diagnostic features uses patient perceptions which can aid in the development of robust diagnostic tools. We explored the information value of these features by modelling the TCM diagnostic process with machine learning, namely decision tree, random forest, and multi-layered perceptron (MLP). Before evaluating the performance, different metrics are tested using dummy models. Accuracy and balanced accuracy are deemed unsuitable due to the large true negatives (TN) that inflates the metrics. Precision and recall are also not suitable to be used alone to determine the overall performance. The F1 and threat scores are suitable metrics that ignores the large TN. The Matthew’s corelation coefficient (MCC) is the best metric as it computes TN as part of the performance while being inert to the class imbalance. The percentage of correct predicted diagnosis may only be useful in determining whether the models are functioning. Although the 3 models did not perform well, MLP had the best performance. Some ways to improve the performance of the models include better record keeping of the TCM data, synthetic minority over sampling (SMOTE), top performing feature selection, using more complex models, and optimisation. Bachelor of Science in Biological Sciences 2021-08-02T02:19:56Z 2021-08-02T02:19:56Z 2021 Final Year Project (FYP) Kon, W. X. (2021). Evaluating the use of different machine learning techniques to predict traditional Chinese medicine diagnosis. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/152316 https://hdl.handle.net/10356/152316 en application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Science::Biological sciences |
spellingShingle |
Science::Biological sciences Kon, Wen Xuan Evaluating the use of different machine learning techniques to predict traditional Chinese medicine diagnosis |
description |
Traditional Chinese Medicine (TCM) diagnostic features uses patient perceptions which can aid in the development of robust diagnostic tools. We explored the information value of these features by modelling the TCM diagnostic process with machine learning, namely decision tree, random forest, and multi-layered perceptron (MLP). Before evaluating the performance, different metrics are tested using dummy models. Accuracy and balanced accuracy are deemed unsuitable due to the large true negatives (TN) that inflates the metrics. Precision and recall are also not suitable to be used alone to determine the overall performance. The F1 and threat scores are suitable metrics that ignores the large TN. The Matthew’s corelation coefficient (MCC) is the best metric as it computes TN as part of the performance while being inert to the class imbalance. The percentage of correct predicted diagnosis may only be useful in determining whether the models are functioning. Although the 3 models did not perform well, MLP had the best performance. Some ways to improve the performance of the models include better record keeping of the TCM data, synthetic minority over sampling (SMOTE), top performing feature selection, using more complex models, and optimisation. |
author2 |
Goh Wen Bin Wilson |
author_facet |
Goh Wen Bin Wilson Kon, Wen Xuan |
format |
Final Year Project |
author |
Kon, Wen Xuan |
author_sort |
Kon, Wen Xuan |
title |
Evaluating the use of different machine learning techniques to predict traditional Chinese medicine diagnosis |
title_short |
Evaluating the use of different machine learning techniques to predict traditional Chinese medicine diagnosis |
title_full |
Evaluating the use of different machine learning techniques to predict traditional Chinese medicine diagnosis |
title_fullStr |
Evaluating the use of different machine learning techniques to predict traditional Chinese medicine diagnosis |
title_full_unstemmed |
Evaluating the use of different machine learning techniques to predict traditional Chinese medicine diagnosis |
title_sort |
evaluating the use of different machine learning techniques to predict traditional chinese medicine diagnosis |
publisher |
Nanyang Technological University |
publishDate |
2021 |
url |
https://hdl.handle.net/10356/152316 |
_version_ |
1759856153757483008 |