Classifying and extracting data from Facebook posts for online persona identification

Large amount of user-generated data areposted online in social media platforms, including user preferences, dining and leisureactivities, events, news and personal blogs.This resulted in varying efforts to process social media data using NLP and ML algorithmsfor topic classification, sentiment analy...

Full description

Saved in:
Bibliographic Details
Main Authors: Brosas, Hazel Anne, Lim, Eugene, Sevilla, Danica, Silva, Denise Anne, Ong, Ethel C.
Format: text
Published: Animo Repository 2018
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/faculty_research/4429
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
Description
Summary:Large amount of user-generated data areposted online in social media platforms, including user preferences, dining and leisureactivities, events, news and personal blogs.This resulted in varying efforts to process social media data using NLP and ML algorithmsfor topic classification, sentiment analysis anddetection, and events classification. Such information are problematic to process, as theytend to be short, informal, inconsistent, andare highly contextualized. A series of tasks isinvolved from collecting, pre-processing, classification and extraction before social mediadata can be used. In this study, we built amulti-class classifier model to process Facebook posts in order to identify a user's onlinepersona based on his/her preferences. Information extraction is then applied to find relevant data from the classified posts that can beused to generate a description of the user's online persona. The classifier currently achievesan accuracy of 76.02% and an F1 score of73.10% using 10-fold cross validation from adataset containing 16,682 posts. © 2018 by the authors.