A multimodal approach to automatic personality recognition on Filipino social media data
There have been studies that found that linguistic style is an independent and meaningful way of exploring personality. These studies have encouraged more research on text-based personality trait recognition. With the growing trend of social media, social networking sites such as Twitter and Instagr...
Saved in:
Main Author: | |
---|---|
Format: | text |
Language: | English |
Published: |
Animo Repository
2021
|
Subjects: | |
Online Access: | https://animorepository.dlsu.edu.ph/etdm_comsci/13 https://animorepository.dlsu.edu.ph/cgi/viewcontent.cgi?article=1013&context=etdm_comsci |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | De La Salle University |
Language: | English |
id |
oai:animorepository.dlsu.edu.ph:etdm_comsci-1013 |
---|---|
record_format |
eprints |
spelling |
oai:animorepository.dlsu.edu.ph:etdm_comsci-10132022-01-07T08:13:11Z A multimodal approach to automatic personality recognition on Filipino social media data Secuya, Alfonso C. There have been studies that found that linguistic style is an independent and meaningful way of exploring personality. These studies have encouraged more research on text-based personality trait recognition. With the growing trend of social media, social networking sites such as Twitter and Instagram have gained a large amount of users, making them an easy source for data. More recent studies have looked into using different types of learning features from the sites such as texts, images, and more. Existing studies have experimented on using these features together, however, most of these studies do not experiment on low-resource language data and which features are important for the language. This research addressed this concern by making use of a filtered dataset consisting of 2231 Filipino Twitter users and experimenting on different types of features used by themselves or together. Different types of machine learning models were made for each personality trait of the Big Five with seven (7) different feature sets. Findings show that TF-IDF are the most useful features. Other text features and account features were not able to show that any learning occurred by themselves but using them together shows a small improvement in performance. Account features also show some learning with the Neuroticism personality trait. Analysis of the learning algorithms show that Support Vector Regressor (SVR) was able to perform the best but it was only able to show significant results with TF-IDF. Usage of the Radial Basis Function kernel with the SVR might have contributed to the performance. Keywords: Natural Language Processing, Machine Learning, Psychology, Personality Traits 2021-09-19T07:00:00Z text application/pdf https://animorepository.dlsu.edu.ph/etdm_comsci/13 https://animorepository.dlsu.edu.ph/cgi/viewcontent.cgi?article=1013&context=etdm_comsci Computer Science Master's Theses English Animo Repository Natural language processing (Computer science) Machine learning Personality Computer Sciences |
institution |
De La Salle University |
building |
De La Salle University Library |
continent |
Asia |
country |
Philippines Philippines |
content_provider |
De La Salle University Library |
collection |
DLSU Institutional Repository |
language |
English |
topic |
Natural language processing (Computer science) Machine learning Personality Computer Sciences |
spellingShingle |
Natural language processing (Computer science) Machine learning Personality Computer Sciences Secuya, Alfonso C. A multimodal approach to automatic personality recognition on Filipino social media data |
description |
There have been studies that found that linguistic style is an independent and meaningful way of exploring personality. These studies have encouraged more research on text-based personality trait recognition. With the growing trend of social media, social networking sites such as Twitter and Instagram have gained a large amount of users, making them an easy source for data. More recent studies have looked into using different types of learning features from the sites such as texts, images, and more. Existing studies have experimented on using these features together, however, most of these studies do not experiment on low-resource language data and which features are important for the language. This research addressed this concern by making use of a filtered dataset consisting of 2231 Filipino Twitter users and experimenting on different types of features used by themselves or together. Different types of machine learning models were made for each personality trait of the Big Five with seven (7) different feature sets. Findings show that TF-IDF are the most useful features. Other text features and account features were not able to show that any learning occurred by themselves but using them together shows a small improvement in performance. Account features also show some learning with the Neuroticism personality trait. Analysis of the learning algorithms show that Support Vector Regressor (SVR) was able to perform the best but it was only able to show significant results with TF-IDF. Usage of the Radial Basis Function kernel with the SVR might have contributed to the performance.
Keywords: Natural Language Processing, Machine Learning, Psychology, Personality Traits |
format |
text |
author |
Secuya, Alfonso C. |
author_facet |
Secuya, Alfonso C. |
author_sort |
Secuya, Alfonso C. |
title |
A multimodal approach to automatic personality recognition on Filipino social media data |
title_short |
A multimodal approach to automatic personality recognition on Filipino social media data |
title_full |
A multimodal approach to automatic personality recognition on Filipino social media data |
title_fullStr |
A multimodal approach to automatic personality recognition on Filipino social media data |
title_full_unstemmed |
A multimodal approach to automatic personality recognition on Filipino social media data |
title_sort |
multimodal approach to automatic personality recognition on filipino social media data |
publisher |
Animo Repository |
publishDate |
2021 |
url |
https://animorepository.dlsu.edu.ph/etdm_comsci/13 https://animorepository.dlsu.edu.ph/cgi/viewcontent.cgi?article=1013&context=etdm_comsci |
_version_ |
1722366416391241728 |