Towards an Infodemiological Algorithm for Classification of Filipino Health Tweets
Finding innovative ICT solutions to enhance the Philippines’ health sector is part and parcel of the Philippine eHealth Strategic Framework and Plan 2020 program. This study sees the opportunity of using collected Twitter data to create a model that processes tweets to produce a dataset that may be...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | text |
Published: |
Archīum Ateneo
2016
|
Subjects: | |
Online Access: | https://archium.ateneo.edu/discs-faculty-pubs/16 https://archium.ateneo.edu/cgi/viewcontent.cgi?article=1015&context=discs-faculty-pubs |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Ateneo De Manila University |
id |
ph-ateneo-arc.discs-faculty-pubs-1015 |
---|---|
record_format |
eprints |
spelling |
ph-ateneo-arc.discs-faculty-pubs-10152020-02-22T02:53:41Z Towards an Infodemiological Algorithm for Classification of Filipino Health Tweets Estuar, Ma. Regina Justina E Espina, Kennedy E Sabido, Delfin Jay, IX Lara, Raymond Josef Edward de los Reyes, Vikki Car Finding innovative ICT solutions to enhance the Philippines’ health sector is part and parcel of the Philippine eHealth Strategic Framework and Plan 2020 program. This study sees the opportunity of using collected Twitter data to create a model that processes tweets to produce a dataset that may be relevant in the field of epidemiology and infodemiology. Through the collection of relevant tweets, future studies may make use of the output of this research for various purposes, such as the improvement of epidemiological systems of the Department of Health in support of the eHealth strategy. In this study, we used the Naïve-Bayes classification model, an efficient text classifier, to create a model that determines whether a tweet is “infodemiological” or not. From the collected 18,044 tweets, we have narrowed it down to 1,090 tweets (6.04%) that can be used in epidemiology. Using this as a dataset for training and testing, the model was able to classify 79.91% of tweets correctly. This research shows that it is indeed feasible to collect and classify enough infodemiological tweets in the Filipino language, which in turn can be used for future infodemiological studies. 2016-01-01T08:00:00Z text application/pdf https://archium.ateneo.edu/discs-faculty-pubs/16 https://archium.ateneo.edu/cgi/viewcontent.cgi?article=1015&context=discs-faculty-pubs Department of Information Systems & Computer Science Faculty Publications Archīum Ateneo Epidemiology Infodemiology Philippines Tweets Modeling Classification Computer Sciences Databases and Information Systems Health Information Technology Social Media |
institution |
Ateneo De Manila University |
building |
Ateneo De Manila University Library |
continent |
Asia |
country |
Philippines Philippines |
content_provider |
Ateneo De Manila University Library |
collection |
archium.Ateneo Institutional Repository |
topic |
Epidemiology Infodemiology Philippines Tweets Modeling Classification Computer Sciences Databases and Information Systems Health Information Technology Social Media |
spellingShingle |
Epidemiology Infodemiology Philippines Tweets Modeling Classification Computer Sciences Databases and Information Systems Health Information Technology Social Media Estuar, Ma. Regina Justina E Espina, Kennedy E Sabido, Delfin Jay, IX Lara, Raymond Josef Edward de los Reyes, Vikki Car Towards an Infodemiological Algorithm for Classification of Filipino Health Tweets |
description |
Finding innovative ICT solutions to enhance the Philippines’ health sector is part and parcel of the Philippine eHealth Strategic Framework and Plan 2020 program. This study sees the opportunity of using collected Twitter data to create a model that processes tweets to produce a dataset that may be relevant in the field of epidemiology and infodemiology. Through the collection of relevant tweets, future studies may make use of the output of this research for various purposes, such as the improvement of epidemiological systems of the Department of Health in support of the eHealth strategy. In this study, we used the Naïve-Bayes classification model, an efficient text classifier, to create a model that determines whether a tweet is “infodemiological” or not. From the collected 18,044 tweets, we have narrowed it down to 1,090 tweets (6.04%) that can be used in epidemiology. Using this as a dataset for training and testing, the model was able to classify 79.91% of tweets correctly. This research shows that it is indeed feasible to collect and classify enough infodemiological tweets in the Filipino language, which in turn can be used for future infodemiological studies. |
format |
text |
author |
Estuar, Ma. Regina Justina E Espina, Kennedy E Sabido, Delfin Jay, IX Lara, Raymond Josef Edward de los Reyes, Vikki Car |
author_facet |
Estuar, Ma. Regina Justina E Espina, Kennedy E Sabido, Delfin Jay, IX Lara, Raymond Josef Edward de los Reyes, Vikki Car |
author_sort |
Estuar, Ma. Regina Justina E |
title |
Towards an Infodemiological Algorithm for Classification of Filipino Health Tweets |
title_short |
Towards an Infodemiological Algorithm for Classification of Filipino Health Tweets |
title_full |
Towards an Infodemiological Algorithm for Classification of Filipino Health Tweets |
title_fullStr |
Towards an Infodemiological Algorithm for Classification of Filipino Health Tweets |
title_full_unstemmed |
Towards an Infodemiological Algorithm for Classification of Filipino Health Tweets |
title_sort |
towards an infodemiological algorithm for classification of filipino health tweets |
publisher |
Archīum Ateneo |
publishDate |
2016 |
url |
https://archium.ateneo.edu/discs-faculty-pubs/16 https://archium.ateneo.edu/cgi/viewcontent.cgi?article=1015&context=discs-faculty-pubs |
_version_ |
1722366481805606912 |