Determinants predicting HIV testing among Filipino women using machine learning models with SMOTE preprocessing

Women face a greater risk of contracting HIV due to their anatomy and the impacts of gender inequality. Despite this, only 8% of Filipino women have ever tested for HIV, according to the results of the 2022 National Demographic Health Survey. This study aimed to identify the determinants of HIV test...

Full description

Saved in:
Bibliographic Details
Main Authors: Arriola, Andrei Joseph, Salting, Francesca Marie Orolfo
Format: text
Language:English
Published: Animo Repository 2024
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/etdb_math/39
https://animorepository.dlsu.edu.ph/context/etdb_math/article/1042/viewcontent/2024_Arriola_Salting_Determinants_predicting_HIV_testing_among_Filipino_women_using_ma.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
Language: English
id oai:animorepository.dlsu.edu.ph:etdb_math-1042
record_format eprints
spelling oai:animorepository.dlsu.edu.ph:etdb_math-10422024-08-20T07:32:03Z Determinants predicting HIV testing among Filipino women using machine learning models with SMOTE preprocessing Arriola, Andrei Joseph Salting, Francesca Marie Orolfo Women face a greater risk of contracting HIV due to their anatomy and the impacts of gender inequality. Despite this, only 8% of Filipino women have ever tested for HIV, according to the results of the 2022 National Demographic Health Survey. This study aimed to identify the determinants of HIV testing to aid in the development of policies and interventions that could improve testing uptake. Relevant factors from stepwise selection were used to predict HIV testing using logistic regression, random forest, and Naïve Bayes classifiers. Since the target class was highly imbalanced, Synthetic Minority Oversampling Technique (SMOTE) preprocessing was implemented before machine learning classification. The classifiers were then evaluated using five-fold cross-validation, and performance metrics precision, recall, and F1 score were computed from resulting confusion matrices. Age, region, residence type, educational attainment, print media use, internet use, wealth, contraceptive use and intention, marital status, age at first sex, and some partner characteristics were found to be significant determinants of HIV testing among Filipino women. Lower rates of HIV testing were associated with older respondents, those from rural households, those with a lower educational attainment, and those with a lower socioeconomic status. Among the three classifiers, random forest and logistic regression performed better with and without SMOTE, respectively. SMOTE preprocessing did not result in any substantial improvements to the logistic regression classifier. As for the two nonparametric machine learning classifiers, random forest, and Näive Bayes, SMOTE yielded higher F1 scores, with higher recall scores coming at the cost of lower precision scores. 2024-08-10T07:00:00Z text application/pdf https://animorepository.dlsu.edu.ph/etdb_math/39 https://animorepository.dlsu.edu.ph/context/etdb_math/article/1042/viewcontent/2024_Arriola_Salting_Determinants_predicting_HIV_testing_among_Filipino_women_using_ma.pdf Mathematics and Statistics Bachelor's Theses English Animo Repository HIV (Viruses)—Testing Women—Health and hygiene--Philippines Logistic regression analysis Mathematics Public Health
institution De La Salle University
building De La Salle University Library
continent Asia
country Philippines
Philippines
content_provider De La Salle University Library
collection DLSU Institutional Repository
language English
topic HIV (Viruses)—Testing
Women—Health and hygiene--Philippines
Logistic regression analysis
Mathematics
Public Health
spellingShingle HIV (Viruses)—Testing
Women—Health and hygiene--Philippines
Logistic regression analysis
Mathematics
Public Health
Arriola, Andrei Joseph
Salting, Francesca Marie Orolfo
Determinants predicting HIV testing among Filipino women using machine learning models with SMOTE preprocessing
description Women face a greater risk of contracting HIV due to their anatomy and the impacts of gender inequality. Despite this, only 8% of Filipino women have ever tested for HIV, according to the results of the 2022 National Demographic Health Survey. This study aimed to identify the determinants of HIV testing to aid in the development of policies and interventions that could improve testing uptake. Relevant factors from stepwise selection were used to predict HIV testing using logistic regression, random forest, and Naïve Bayes classifiers. Since the target class was highly imbalanced, Synthetic Minority Oversampling Technique (SMOTE) preprocessing was implemented before machine learning classification. The classifiers were then evaluated using five-fold cross-validation, and performance metrics precision, recall, and F1 score were computed from resulting confusion matrices. Age, region, residence type, educational attainment, print media use, internet use, wealth, contraceptive use and intention, marital status, age at first sex, and some partner characteristics were found to be significant determinants of HIV testing among Filipino women. Lower rates of HIV testing were associated with older respondents, those from rural households, those with a lower educational attainment, and those with a lower socioeconomic status. Among the three classifiers, random forest and logistic regression performed better with and without SMOTE, respectively. SMOTE preprocessing did not result in any substantial improvements to the logistic regression classifier. As for the two nonparametric machine learning classifiers, random forest, and Näive Bayes, SMOTE yielded higher F1 scores, with higher recall scores coming at the cost of lower precision scores.
format text
author Arriola, Andrei Joseph
Salting, Francesca Marie Orolfo
author_facet Arriola, Andrei Joseph
Salting, Francesca Marie Orolfo
author_sort Arriola, Andrei Joseph
title Determinants predicting HIV testing among Filipino women using machine learning models with SMOTE preprocessing
title_short Determinants predicting HIV testing among Filipino women using machine learning models with SMOTE preprocessing
title_full Determinants predicting HIV testing among Filipino women using machine learning models with SMOTE preprocessing
title_fullStr Determinants predicting HIV testing among Filipino women using machine learning models with SMOTE preprocessing
title_full_unstemmed Determinants predicting HIV testing among Filipino women using machine learning models with SMOTE preprocessing
title_sort determinants predicting hiv testing among filipino women using machine learning models with smote preprocessing
publisher Animo Repository
publishDate 2024
url https://animorepository.dlsu.edu.ph/etdb_math/39
https://animorepository.dlsu.edu.ph/context/etdb_math/article/1042/viewcontent/2024_Arriola_Salting_Determinants_predicting_HIV_testing_among_Filipino_women_using_ma.pdf
_version_ 1808617076472938496