Towards an Infodemiological Algorithm for Classification of Filipino Health Tweets

Finding innovative ICT solutions to enhance the Philippines’ health sector is part and parcel of the Philippine eHealth Strategic Framework and Plan 2020 program. This study sees the opportunity of using collected Twitter data to create a model that processes tweets to produce a dataset that may be...

Full description

Saved in:
Bibliographic Details
Main Authors: Estuar, Ma. Regina Justina E, Espina, Kennedy E, Sabido, Delfin Jay, IX, Lara, Raymond Josef Edward, de los Reyes, Vikki Car
Format: text
Published: Archīum Ateneo 2016
Subjects:
Online Access:https://archium.ateneo.edu/discs-faculty-pubs/16
https://archium.ateneo.edu/cgi/viewcontent.cgi?article=1015&context=discs-faculty-pubs
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Ateneo De Manila University
id ph-ateneo-arc.discs-faculty-pubs-1015
record_format eprints
spelling ph-ateneo-arc.discs-faculty-pubs-10152020-02-22T02:53:41Z Towards an Infodemiological Algorithm for Classification of Filipino Health Tweets Estuar, Ma. Regina Justina E Espina, Kennedy E Sabido, Delfin Jay, IX Lara, Raymond Josef Edward de los Reyes, Vikki Car Finding innovative ICT solutions to enhance the Philippines’ health sector is part and parcel of the Philippine eHealth Strategic Framework and Plan 2020 program. This study sees the opportunity of using collected Twitter data to create a model that processes tweets to produce a dataset that may be relevant in the field of epidemiology and infodemiology. Through the collection of relevant tweets, future studies may make use of the output of this research for various purposes, such as the improvement of epidemiological systems of the Department of Health in support of the eHealth strategy. In this study, we used the Naïve-Bayes classification model, an efficient text classifier, to create a model that determines whether a tweet is “infodemiological” or not. From the collected 18,044 tweets, we have narrowed it down to 1,090 tweets (6.04%) that can be used in epidemiology. Using this as a dataset for training and testing, the model was able to classify 79.91% of tweets correctly. This research shows that it is indeed feasible to collect and classify enough infodemiological tweets in the Filipino language, which in turn can be used for future infodemiological studies. 2016-01-01T08:00:00Z text application/pdf https://archium.ateneo.edu/discs-faculty-pubs/16 https://archium.ateneo.edu/cgi/viewcontent.cgi?article=1015&context=discs-faculty-pubs Department of Information Systems & Computer Science Faculty Publications Archīum Ateneo Epidemiology Infodemiology Philippines Tweets Modeling Classification Computer Sciences Databases and Information Systems Health Information Technology Social Media
institution Ateneo De Manila University
building Ateneo De Manila University Library
continent Asia
country Philippines
Philippines
content_provider Ateneo De Manila University Library
collection archium.Ateneo Institutional Repository
topic Epidemiology
Infodemiology
Philippines
Tweets
Modeling
Classification
Computer Sciences
Databases and Information Systems
Health Information Technology
Social Media
spellingShingle Epidemiology
Infodemiology
Philippines
Tweets
Modeling
Classification
Computer Sciences
Databases and Information Systems
Health Information Technology
Social Media
Estuar, Ma. Regina Justina E
Espina, Kennedy E
Sabido, Delfin Jay, IX
Lara, Raymond Josef Edward
de los Reyes, Vikki Car
Towards an Infodemiological Algorithm for Classification of Filipino Health Tweets
description Finding innovative ICT solutions to enhance the Philippines’ health sector is part and parcel of the Philippine eHealth Strategic Framework and Plan 2020 program. This study sees the opportunity of using collected Twitter data to create a model that processes tweets to produce a dataset that may be relevant in the field of epidemiology and infodemiology. Through the collection of relevant tweets, future studies may make use of the output of this research for various purposes, such as the improvement of epidemiological systems of the Department of Health in support of the eHealth strategy. In this study, we used the Naïve-Bayes classification model, an efficient text classifier, to create a model that determines whether a tweet is “infodemiological” or not. From the collected 18,044 tweets, we have narrowed it down to 1,090 tweets (6.04%) that can be used in epidemiology. Using this as a dataset for training and testing, the model was able to classify 79.91% of tweets correctly. This research shows that it is indeed feasible to collect and classify enough infodemiological tweets in the Filipino language, which in turn can be used for future infodemiological studies.
format text
author Estuar, Ma. Regina Justina E
Espina, Kennedy E
Sabido, Delfin Jay, IX
Lara, Raymond Josef Edward
de los Reyes, Vikki Car
author_facet Estuar, Ma. Regina Justina E
Espina, Kennedy E
Sabido, Delfin Jay, IX
Lara, Raymond Josef Edward
de los Reyes, Vikki Car
author_sort Estuar, Ma. Regina Justina E
title Towards an Infodemiological Algorithm for Classification of Filipino Health Tweets
title_short Towards an Infodemiological Algorithm for Classification of Filipino Health Tweets
title_full Towards an Infodemiological Algorithm for Classification of Filipino Health Tweets
title_fullStr Towards an Infodemiological Algorithm for Classification of Filipino Health Tweets
title_full_unstemmed Towards an Infodemiological Algorithm for Classification of Filipino Health Tweets
title_sort towards an infodemiological algorithm for classification of filipino health tweets
publisher Archīum Ateneo
publishDate 2016
url https://archium.ateneo.edu/discs-faculty-pubs/16
https://archium.ateneo.edu/cgi/viewcontent.cgi?article=1015&context=discs-faculty-pubs
_version_ 1722366481805606912