Adaptive Variable Extractions with LDA for Classification of Mixed Variables, and Applications to Medical Data

The strategy surrounding the extraction of a number of mixed variables is examined in this paper in building a model for Linear Discriminant Analysis (LDA). Two methods for extracting crucial variables from a dataset with categorical and continuous variables were employed, namely, multiple correspon...

Full description

Saved in:

Bibliographic Details
Main Authors:	Hamid, Hashibah, Mahat, Nor Idayu, Ibrahim, Safwati
Format:	Article
Language:	English
Published:	Universiti Utara Malaysia Press 2021
Subjects:	QA Mathematics
Online Access:	https://repo.uum.edu.my/id/eprint/28777/1/JICT%2020%2003%202021%20305-327.pdf https://doi.org/10.32890/jict2021.20.3.2 https://repo.uum.edu.my/id/eprint/28777/ https://e-journal.uum.edu.my/index.php/jict/article/view/14381 https://doi.org/10.32890/jict2021.20.3.2
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Universiti Utara Malaysia
Language:	English

id	my.uum.repo.28777
record_format	eprints
spelling	my.uum.repo.287772023-05-21T15:14:03Z https://repo.uum.edu.my/id/eprint/28777/ Adaptive Variable Extractions with LDA for Classification of Mixed Variables, and Applications to Medical Data Hamid, Hashibah Mahat, Nor Idayu Ibrahim, Safwati QA Mathematics The strategy surrounding the extraction of a number of mixed variables is examined in this paper in building a model for Linear Discriminant Analysis (LDA). Two methods for extracting crucial variables from a dataset with categorical and continuous variables were employed, namely, multiple correspondence analysis (MCA) and principal component analysis (PCA). However, in this case, direct use of either MCA or PCA on mixed variables is impossible due to restrictions on the structure of data that each method could handle. Therefore, this paper executes some adjustments including a strategy for managing mixed variables so that those mixed variables are equivalent in values. With this, both MCA and PCA can be performed on mixed variables simultaneously. The variables following this strategy of extraction were then utilized in the construction of the LDA model before applying them to classify objects going forward. The suggested models, using three real sets of medical data were then tested, where the results indicated that using a combination of the two methods of MCA and PCA for extraction and LDA could reduce the model’s size, having a positive effect on classifying and better performance of the model since it leads towards minimizing the leave-one-out error rate. Accordingly, the models proposed in this paper, including the strategy that was adapted was successful in presenting good results over the full LDA model. Regarding the indicators that were used to extract and to retain the variables in the model, cumulative variance explained (CVE), eigenvalue, and a non-significant shift in the CVE (constant change), could be considered a useful reference or guideline for practitioners experiencing similar issues in future. Universiti Utara Malaysia Press 2021 Article PeerReviewed application/pdf en cc4_by https://repo.uum.edu.my/id/eprint/28777/1/JICT%2020%2003%202021%20305-327.pdf Hamid, Hashibah and Mahat, Nor Idayu and Ibrahim, Safwati (2021) Adaptive Variable Extractions with LDA for Classification of Mixed Variables, and Applications to Medical Data. Journal of Information and Communication Technology, 20 (03). pp. 305-327. ISSN 2180-3862 https://e-journal.uum.edu.my/index.php/jict/article/view/14381 https://doi.org/10.32890/jict2021.20.3.2 https://doi.org/10.32890/jict2021.20.3.2
institution	Universiti Utara Malaysia
building	UUM Library
collection	Institutional Repository
continent	Asia
country	Malaysia
content_provider	Universiti Utara Malaysia
content_source	UUM Institutional Repository
url_provider	http://repo.uum.edu.my/
language	English
topic	QA Mathematics
spellingShingle	QA Mathematics Hamid, Hashibah Mahat, Nor Idayu Ibrahim, Safwati Adaptive Variable Extractions with LDA for Classification of Mixed Variables, and Applications to Medical Data
description	The strategy surrounding the extraction of a number of mixed variables is examined in this paper in building a model for Linear Discriminant Analysis (LDA). Two methods for extracting crucial variables from a dataset with categorical and continuous variables were employed, namely, multiple correspondence analysis (MCA) and principal component analysis (PCA). However, in this case, direct use of either MCA or PCA on mixed variables is impossible due to restrictions on the structure of data that each method could handle. Therefore, this paper executes some adjustments including a strategy for managing mixed variables so that those mixed variables are equivalent in values. With this, both MCA and PCA can be performed on mixed variables simultaneously. The variables following this strategy of extraction were then utilized in the construction of the LDA model before applying them to classify objects going forward. The suggested models, using three real sets of medical data were then tested, where the results indicated that using a combination of the two methods of MCA and PCA for extraction and LDA could reduce the model’s size, having a positive effect on classifying and better performance of the model since it leads towards minimizing the leave-one-out error rate. Accordingly, the models proposed in this paper, including the strategy that was adapted was successful in presenting good results over the full LDA model. Regarding the indicators that were used to extract and to retain the variables in the model, cumulative variance explained (CVE), eigenvalue, and a non-significant shift in the CVE (constant change), could be considered a useful reference or guideline for practitioners experiencing similar issues in future.
format	Article
author	Hamid, Hashibah Mahat, Nor Idayu Ibrahim, Safwati
author_facet	Hamid, Hashibah Mahat, Nor Idayu Ibrahim, Safwati
author_sort	Hamid, Hashibah
title	Adaptive Variable Extractions with LDA for Classification of Mixed Variables, and Applications to Medical Data
title_short	Adaptive Variable Extractions with LDA for Classification of Mixed Variables, and Applications to Medical Data
title_full	Adaptive Variable Extractions with LDA for Classification of Mixed Variables, and Applications to Medical Data
title_fullStr	Adaptive Variable Extractions with LDA for Classification of Mixed Variables, and Applications to Medical Data
title_full_unstemmed	Adaptive Variable Extractions with LDA for Classification of Mixed Variables, and Applications to Medical Data
title_sort	adaptive variable extractions with lda for classification of mixed variables, and applications to medical data
publisher	Universiti Utara Malaysia Press
publishDate	2021
url	https://repo.uum.edu.my/id/eprint/28777/1/JICT%2020%2003%202021%20305-327.pdf https://doi.org/10.32890/jict2021.20.3.2 https://repo.uum.edu.my/id/eprint/28777/ https://e-journal.uum.edu.my/index.php/jict/article/view/14381 https://doi.org/10.32890/jict2021.20.3.2
_version_	1768010680637587456

Adaptive Variable Extractions with LDA for Classification of Mixed Variables, and Applications to Medical Data

Similar Items