Development of a Text Classification Model to Detect Disinformation About COVID-19 in Social Media: Understanding the Features and Narratives of Disinformation in the Philippines
As of January 02, 2022, the Philippines is combating another surge in COVID-19 cases. With vaccinations still ongoing, the country remains vigilant and the government continues to promote compliance to minimum health standards as preventive measures to minimize the spread. Disinformation remains a c...
Saved in:
Main Authors: | , , , , , |
---|---|
Format: | text |
Published: |
Archīum Ateneo
2022
|
Subjects: | |
Online Access: | https://archium.ateneo.edu/discs-faculty-pubs/338 https://doi.org/10.1007/978-3-031-05061-9_27 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Ateneo De Manila University |
id |
ph-ateneo-arc.discs-faculty-pubs-1338 |
---|---|
record_format |
eprints |
spelling |
ph-ateneo-arc.discs-faculty-pubs-13382022-12-02T05:44:48Z Development of a Text Classification Model to Detect Disinformation About COVID-19 in Social Media: Understanding the Features and Narratives of Disinformation in the Philippines Tan, Hans Calvin L Estuar, Ma. Regina Justina Co, Nicole Allison S Tan, Austin Sebastien Abao, Roland P Aureus, Jelly P As of January 02, 2022, the Philippines is combating another surge in COVID-19 cases. With vaccinations still ongoing, the country remains vigilant and the government continues to promote compliance to minimum health standards as preventive measures to minimize the spread. Disinformation remains a challenge especially if compliance to minimum health standards and adoption of health interventions are necessary to curb the spread of COVID-19. Incorrect and unverified information about the virus increased as well which continues to run rampant in social media and with minimal models to detect disinformation in a Philippine context. The study aimed to understand the features of disinformation of COVID-19 in a Philippine context with the goal of creating a text classification model to detect disinformation of COVID-19 in social media to promote vaccine usage in the country. The usage of social network analysis was performed to understand the narratives present regarding COVID-19 disinformation. Words related to vaccines, government corruption, and government mismanagement were prevalent under the disinformation categories of “False” and “Mostly False” while words related to health information such as cases or vaccine counts were prevalent under the “Mostly True” and “True” category. Linear SVM text classification model performed the best through accuracy, precision, and recall in detecting disinformation by using TF-IDF as a feature compared to using both TF-IDF and n-grams. Disinformation narratives revolved around the idea of COVID-19 cases/vaccines, government mismanagement, and regulations. Results showed that disinformation caused distrust of the government’s management over the pandemic. Moreover, the spread of disinformation was contained to the user itself and spread to at least one other user. 2022-01-01T08:00:00Z text https://archium.ateneo.edu/discs-faculty-pubs/338 https://doi.org/10.1007/978-3-031-05061-9_27 Department of Information Systems & Computer Science Faculty Publications Archīum Ateneo COVID-19 Philippine disinformation TF-IDF N-Grams Text classification model Social network analysis Communication Communication Technology and New Media Computer Engineering Engineering Social and Behavioral Sciences Social Media |
institution |
Ateneo De Manila University |
building |
Ateneo De Manila University Library |
continent |
Asia |
country |
Philippines Philippines |
content_provider |
Ateneo De Manila University Library |
collection |
archium.Ateneo Institutional Repository |
topic |
COVID-19 Philippine disinformation TF-IDF N-Grams Text classification model Social network analysis Communication Communication Technology and New Media Computer Engineering Engineering Social and Behavioral Sciences Social Media |
spellingShingle |
COVID-19 Philippine disinformation TF-IDF N-Grams Text classification model Social network analysis Communication Communication Technology and New Media Computer Engineering Engineering Social and Behavioral Sciences Social Media Tan, Hans Calvin L Estuar, Ma. Regina Justina Co, Nicole Allison S Tan, Austin Sebastien Abao, Roland P Aureus, Jelly P Development of a Text Classification Model to Detect Disinformation About COVID-19 in Social Media: Understanding the Features and Narratives of Disinformation in the Philippines |
description |
As of January 02, 2022, the Philippines is combating another surge in COVID-19 cases. With vaccinations still ongoing, the country remains vigilant and the government continues to promote compliance to minimum health standards as preventive measures to minimize the spread. Disinformation remains a challenge especially if compliance to minimum health standards and adoption of health interventions are necessary to curb the spread of COVID-19. Incorrect and unverified information about the virus increased as well which continues to run rampant in social media and with minimal models to detect disinformation in a Philippine context. The study aimed to understand the features of disinformation of COVID-19 in a Philippine context with the goal of creating a text classification model to detect disinformation of COVID-19 in social media to promote vaccine usage in the country. The usage of social network analysis was performed to understand the narratives present regarding COVID-19 disinformation. Words related to vaccines, government corruption, and government mismanagement were prevalent under the disinformation categories of “False” and “Mostly False” while words related to health information such as cases or vaccine counts were prevalent under the “Mostly True” and “True” category. Linear SVM text classification model performed the best through accuracy, precision, and recall in detecting disinformation by using TF-IDF as a feature compared to using both TF-IDF and n-grams. Disinformation narratives revolved around the idea of COVID-19 cases/vaccines, government mismanagement, and regulations. Results showed that disinformation caused distrust of the government’s management over the pandemic. Moreover, the spread of disinformation was contained to the user itself and spread to at least one other user. |
format |
text |
author |
Tan, Hans Calvin L Estuar, Ma. Regina Justina Co, Nicole Allison S Tan, Austin Sebastien Abao, Roland P Aureus, Jelly P |
author_facet |
Tan, Hans Calvin L Estuar, Ma. Regina Justina Co, Nicole Allison S Tan, Austin Sebastien Abao, Roland P Aureus, Jelly P |
author_sort |
Tan, Hans Calvin L |
title |
Development of a Text Classification Model to Detect Disinformation About COVID-19 in Social Media: Understanding the Features and Narratives of Disinformation in the Philippines |
title_short |
Development of a Text Classification Model to Detect Disinformation About COVID-19 in Social Media: Understanding the Features and Narratives of Disinformation in the Philippines |
title_full |
Development of a Text Classification Model to Detect Disinformation About COVID-19 in Social Media: Understanding the Features and Narratives of Disinformation in the Philippines |
title_fullStr |
Development of a Text Classification Model to Detect Disinformation About COVID-19 in Social Media: Understanding the Features and Narratives of Disinformation in the Philippines |
title_full_unstemmed |
Development of a Text Classification Model to Detect Disinformation About COVID-19 in Social Media: Understanding the Features and Narratives of Disinformation in the Philippines |
title_sort |
development of a text classification model to detect disinformation about covid-19 in social media: understanding the features and narratives of disinformation in the philippines |
publisher |
Archīum Ateneo |
publishDate |
2022 |
url |
https://archium.ateneo.edu/discs-faculty-pubs/338 https://doi.org/10.1007/978-3-031-05061-9_27 |
_version_ |
1751550476664438784 |