Applying XGBoost, Neural Networks, and Oversampling in the Undernutrition Classification of School-Aged Children in the Philippines

In the Philippines, one in five school-aged children are affected by undernutrition, increasing their risk of physical and cognitive development. The Department of Education (DepEd) attempts to address this issue by targeting children with low body mass index (BMI) for their school-based feeding pro...

全面介紹

Saved in:
書目詳細資料
Main Authors: Yiu, Mark Kevin A.Ong, Pastor, Carlo Gabriel M., Candano, Gabrielle Jackie C., Miro, Eden Delight, Antonio, Victor Andrew A., Go, Clark Kendrick C
格式: text
出版: Archīum Ateneo 2024
主題:
在線閱讀:https://archium.ateneo.edu/mathematics-faculty-pubs/301
https://doi.org/10.1063/5.0213404
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
機構: Ateneo De Manila University
id ph-ateneo-arc.mathematics-faculty-pubs-1304
record_format eprints
spelling ph-ateneo-arc.mathematics-faculty-pubs-13042025-05-22T06:54:15Z Applying XGBoost, Neural Networks, and Oversampling in the Undernutrition Classification of School-Aged Children in the Philippines Yiu, Mark Kevin A.Ong Pastor, Carlo Gabriel M. Candano, Gabrielle Jackie C. Miro, Eden Delight Antonio, Victor Andrew A. Go, Clark Kendrick C In the Philippines, one in five school-aged children are affected by undernutrition, increasing their risk of physical and cognitive development. The Department of Education (DepEd) attempts to address this issue by targeting children with low body mass index (BMI) for their school-based feeding program (SBFP). However, challenges like inadequate measuring tools and supervision in low-resource communities have led to large discrepancies in the nutritional status of SBFP beneficiaries and non-beneficiaries. Siy Van et al. [1] addresses the difficulties associated with BMI by using machine learning (ML) to predict undernutrition among school-aged children based on socioeconomic and demographic characteristics, dietary diversity scores, and food insecurity scores. Their study compared several ML algorithms and found that their best performing model in terms of accuracy was a random forest (RF) model. However, the RF model had high sensitivity with low specificity, indicating a bias towards the positive class. This study aims to improve these results by employing oversampling techniques and other ML algorithms that were not used in the study. Using the same data set in [1], this study compares four machine learning algorithms (RF, XGBoost, DNN, and NNRF) to predict undernutrition among school-aged children, managing imbalanced data using three oversampling techniques (SMOTE, Borderline-SMOTE, and ADASYN). Eight independent classification tasks for predicting undernutrition were performed, and results showed that a RF-Borderline model performed the best in terms of Cohen’s κ (0.3662), with an accuracy of 71.61%, sensitivity of 71.13%, and a specificity of 73.08%. While RF performed the best overall, XGBoost and NNRF performed better than RF on specific tasks. Notably, incorporating oversampling consistently enhanced model performance. 2024-07-12T07:00:00Z text https://archium.ateneo.edu/mathematics-faculty-pubs/301 https://doi.org/10.1063/5.0213404 Mathematics Faculty Publications Archīum Ateneo Applied Mathematics Data Science
institution Ateneo De Manila University
building Ateneo De Manila University Library
continent Asia
country Philippines
Philippines
content_provider Ateneo De Manila University Library
collection archium.Ateneo Institutional Repository
topic Applied Mathematics
Data Science
spellingShingle Applied Mathematics
Data Science
Yiu, Mark Kevin A.Ong
Pastor, Carlo Gabriel M.
Candano, Gabrielle Jackie C.
Miro, Eden Delight
Antonio, Victor Andrew A.
Go, Clark Kendrick C
Applying XGBoost, Neural Networks, and Oversampling in the Undernutrition Classification of School-Aged Children in the Philippines
description In the Philippines, one in five school-aged children are affected by undernutrition, increasing their risk of physical and cognitive development. The Department of Education (DepEd) attempts to address this issue by targeting children with low body mass index (BMI) for their school-based feeding program (SBFP). However, challenges like inadequate measuring tools and supervision in low-resource communities have led to large discrepancies in the nutritional status of SBFP beneficiaries and non-beneficiaries. Siy Van et al. [1] addresses the difficulties associated with BMI by using machine learning (ML) to predict undernutrition among school-aged children based on socioeconomic and demographic characteristics, dietary diversity scores, and food insecurity scores. Their study compared several ML algorithms and found that their best performing model in terms of accuracy was a random forest (RF) model. However, the RF model had high sensitivity with low specificity, indicating a bias towards the positive class. This study aims to improve these results by employing oversampling techniques and other ML algorithms that were not used in the study. Using the same data set in [1], this study compares four machine learning algorithms (RF, XGBoost, DNN, and NNRF) to predict undernutrition among school-aged children, managing imbalanced data using three oversampling techniques (SMOTE, Borderline-SMOTE, and ADASYN). Eight independent classification tasks for predicting undernutrition were performed, and results showed that a RF-Borderline model performed the best in terms of Cohen’s κ (0.3662), with an accuracy of 71.61%, sensitivity of 71.13%, and a specificity of 73.08%. While RF performed the best overall, XGBoost and NNRF performed better than RF on specific tasks. Notably, incorporating oversampling consistently enhanced model performance.
format text
author Yiu, Mark Kevin A.Ong
Pastor, Carlo Gabriel M.
Candano, Gabrielle Jackie C.
Miro, Eden Delight
Antonio, Victor Andrew A.
Go, Clark Kendrick C
author_facet Yiu, Mark Kevin A.Ong
Pastor, Carlo Gabriel M.
Candano, Gabrielle Jackie C.
Miro, Eden Delight
Antonio, Victor Andrew A.
Go, Clark Kendrick C
author_sort Yiu, Mark Kevin A.Ong
title Applying XGBoost, Neural Networks, and Oversampling in the Undernutrition Classification of School-Aged Children in the Philippines
title_short Applying XGBoost, Neural Networks, and Oversampling in the Undernutrition Classification of School-Aged Children in the Philippines
title_full Applying XGBoost, Neural Networks, and Oversampling in the Undernutrition Classification of School-Aged Children in the Philippines
title_fullStr Applying XGBoost, Neural Networks, and Oversampling in the Undernutrition Classification of School-Aged Children in the Philippines
title_full_unstemmed Applying XGBoost, Neural Networks, and Oversampling in the Undernutrition Classification of School-Aged Children in the Philippines
title_sort applying xgboost, neural networks, and oversampling in the undernutrition classification of school-aged children in the philippines
publisher Archīum Ateneo
publishDate 2024
url https://archium.ateneo.edu/mathematics-faculty-pubs/301
https://doi.org/10.1063/5.0213404
_version_ 1833068561844666368