#Walangpasok on Twitter: Natural language processing as a method for analyzing tweets on class suspensions in the Philippines
In this study, we aim to prove that natural language processing (NLP) can be used as a method for analyzing qualitative data like tweets. To validate this, we examine various topics emerging from tweets about class suspensions in the Philippines using #walangpasok (no classes). By utilizing three NL...
Saved in:
Main Authors: | , , |
---|---|
Format: | text |
Published: |
Animo Repository
2020
|
Subjects: | |
Online Access: | https://animorepository.dlsu.edu.ph/faculty_research/424 https://animorepository.dlsu.edu.ph/context/faculty_research/article/1423/type/native/viewcontent |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | De La Salle University |
id |
oai:animorepository.dlsu.edu.ph:faculty_research-1423 |
---|---|
record_format |
eprints |
spelling |
oai:animorepository.dlsu.edu.ph:faculty_research-14232021-11-23T06:27:40Z #Walangpasok on Twitter: Natural language processing as a method for analyzing tweets on class suspensions in the Philippines Ancheta, Jeffrey Rosario Gorro, Ken D. Uy, Mark Anthony D. In this study, we aim to prove that natural language processing (NLP) can be used as a method for analyzing qualitative data like tweets. To validate this, we examine various topics emerging from tweets about class suspensions in the Philippines using #walangpasok (no classes). By utilizing three NLP techniques, we presented various topics. Using Latent Dirichlet Allocation (LDA), the study identified the following: (1) information dissemination and public announcements from government offices, and (2) sentiments of students. Through Word2Vec, we generated (1) monitoring and dissemination of alerts and warnings, (2) local government accountability, and (3) sentiments of Twitter users. An intrinsic evaluation was conducted for word2vec model using cosine similarity. The word-groups have an average cosine similarity of 0.997811. Lastly, the topics that emerged using K-means are (1) weather advisory, (2) class suspension announcements, (3) role of local government, and (4) users' sentiments and outlook towards a situation. The number of clusters generated by the k-means clustering algorithm was decided based on the silhouette score of 0.0153423865462. It turns out that the study provided the same results using NLP techniques. We also performed open coding to analyze the data manually and to ensure that the obtained results using the techniques are accurate. Thus, the use of NLP as a method of qualitative data analysis can be considered reliable and may be recommended to use in other qualitative research, particularly in the field of social sciences. © 2020 IEEE. 2020-01-01T08:00:00Z text text/html https://animorepository.dlsu.edu.ph/faculty_research/424 https://animorepository.dlsu.edu.ph/context/faculty_research/article/1423/type/native/viewcontent Faculty Research Work Animo Repository Natural language processing (Computer science) Microblogs Linguistic analysis (Linguistics) Computer Sciences South and Southeast Asian Languages and Societies |
institution |
De La Salle University |
building |
De La Salle University Library |
continent |
Asia |
country |
Philippines Philippines |
content_provider |
De La Salle University Library |
collection |
DLSU Institutional Repository |
topic |
Natural language processing (Computer science) Microblogs Linguistic analysis (Linguistics) Computer Sciences South and Southeast Asian Languages and Societies |
spellingShingle |
Natural language processing (Computer science) Microblogs Linguistic analysis (Linguistics) Computer Sciences South and Southeast Asian Languages and Societies Ancheta, Jeffrey Rosario Gorro, Ken D. Uy, Mark Anthony D. #Walangpasok on Twitter: Natural language processing as a method for analyzing tweets on class suspensions in the Philippines |
description |
In this study, we aim to prove that natural language processing (NLP) can be used as a method for analyzing qualitative data like tweets. To validate this, we examine various topics emerging from tweets about class suspensions in the Philippines using #walangpasok (no classes). By utilizing three NLP techniques, we presented various topics. Using Latent Dirichlet Allocation (LDA), the study identified the following: (1) information dissemination and public announcements from government offices, and (2) sentiments of students. Through Word2Vec, we generated (1) monitoring and dissemination of alerts and warnings, (2) local government accountability, and (3) sentiments of Twitter users. An intrinsic evaluation was conducted for word2vec model using cosine similarity. The word-groups have an average cosine similarity of 0.997811. Lastly, the topics that emerged using K-means are (1) weather advisory, (2) class suspension announcements, (3) role of local government, and (4) users' sentiments and outlook towards a situation. The number of clusters generated by the k-means clustering algorithm was decided based on the silhouette score of 0.0153423865462. It turns out that the study provided the same results using NLP techniques. We also performed open coding to analyze the data manually and to ensure that the obtained results using the techniques are accurate. Thus, the use of NLP as a method of qualitative data analysis can be considered reliable and may be recommended to use in other qualitative research, particularly in the field of social sciences. © 2020 IEEE. |
format |
text |
author |
Ancheta, Jeffrey Rosario Gorro, Ken D. Uy, Mark Anthony D. |
author_facet |
Ancheta, Jeffrey Rosario Gorro, Ken D. Uy, Mark Anthony D. |
author_sort |
Ancheta, Jeffrey Rosario |
title |
#Walangpasok on Twitter: Natural language processing as a method for analyzing tweets on class suspensions in the Philippines |
title_short |
#Walangpasok on Twitter: Natural language processing as a method for analyzing tweets on class suspensions in the Philippines |
title_full |
#Walangpasok on Twitter: Natural language processing as a method for analyzing tweets on class suspensions in the Philippines |
title_fullStr |
#Walangpasok on Twitter: Natural language processing as a method for analyzing tweets on class suspensions in the Philippines |
title_full_unstemmed |
#Walangpasok on Twitter: Natural language processing as a method for analyzing tweets on class suspensions in the Philippines |
title_sort |
#walangpasok on twitter: natural language processing as a method for analyzing tweets on class suspensions in the philippines |
publisher |
Animo Repository |
publishDate |
2020 |
url |
https://animorepository.dlsu.edu.ph/faculty_research/424 https://animorepository.dlsu.edu.ph/context/faculty_research/article/1423/type/native/viewcontent |
_version_ |
1718383377941266432 |