Modeling personality traits of Filipino twitter users

Recent studies in the field of text-based personality recognition experiment with different languages, feature extraction techniques, and machine learning algorithms to create better and more accurate models; however, little focus is placed on exploring the language use of a group of individuals def...

Full description

Saved in:
Bibliographic Details
Main Authors: Tighe, Edward P., Cheng, Charibeth K.
Format: text
Published: Animo Repository 2018
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/faculty_research/13396
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
id oai:animorepository.dlsu.edu.ph:faculty_research-15150
record_format eprints
spelling oai:animorepository.dlsu.edu.ph:faculty_research-151502024-11-11T08:04:33Z Modeling personality traits of Filipino twitter users Tighe, Edward P. Cheng, Charibeth K. Recent studies in the field of text-based personality recognition experiment with different languages, feature extraction techniques, and machine learning algorithms to create better and more accurate models; however, little focus is placed on exploring the language use of a group of individuals defined by nationality. Individuals of the same nationality share certain practices and communicate certain ideas that can become embedded into their natural language. Many nationals are also not limited to speaking just one language, such as how Filipinos speak Filipino and English, the two national languages of the Philippines. The addition of several regional/indigenous languages, along with the commonness of codeswitching, allow for a Filipino to have a rich vocabulary. This presents an opportunity to create a text-based personality model based on how Filipinos speak, regardless of the language they use. To do so, data was collected from 250 Filipino Twitter users. Different combinations of data processing techniques were experimented upon to create personality models for each of the Big Five. The results for both regression and classification show that Conscientiousness is consistently the easiest trait to model, followed by Extraversion. Classification models for Agreeableness and Neuroticism had subpar performances, but performed better than those of Openness. An analysis on personality trait score representation showed that classifying extreme outliers generally produce better results for all traits except for Neuroticism and Openness. 2018-01-01T08:00:00Z text https://animorepository.dlsu.edu.ph/faculty_research/13396 Faculty Research Work Animo Repository Natural language processing (Computer science) Information filtering systems Machine learning Personality Online social networks Computer Sciences
institution De La Salle University
building De La Salle University Library
continent Asia
country Philippines
Philippines
content_provider De La Salle University Library
collection DLSU Institutional Repository
topic Natural language processing (Computer science)
Information filtering systems
Machine learning
Personality
Online social networks
Computer Sciences
spellingShingle Natural language processing (Computer science)
Information filtering systems
Machine learning
Personality
Online social networks
Computer Sciences
Tighe, Edward P.
Cheng, Charibeth K.
Modeling personality traits of Filipino twitter users
description Recent studies in the field of text-based personality recognition experiment with different languages, feature extraction techniques, and machine learning algorithms to create better and more accurate models; however, little focus is placed on exploring the language use of a group of individuals defined by nationality. Individuals of the same nationality share certain practices and communicate certain ideas that can become embedded into their natural language. Many nationals are also not limited to speaking just one language, such as how Filipinos speak Filipino and English, the two national languages of the Philippines. The addition of several regional/indigenous languages, along with the commonness of codeswitching, allow for a Filipino to have a rich vocabulary. This presents an opportunity to create a text-based personality model based on how Filipinos speak, regardless of the language they use. To do so, data was collected from 250 Filipino Twitter users. Different combinations of data processing techniques were experimented upon to create personality models for each of the Big Five. The results for both regression and classification show that Conscientiousness is consistently the easiest trait to model, followed by Extraversion. Classification models for Agreeableness and Neuroticism had subpar performances, but performed better than those of Openness. An analysis on personality trait score representation showed that classifying extreme outliers generally produce better results for all traits except for Neuroticism and Openness.
format text
author Tighe, Edward P.
Cheng, Charibeth K.
author_facet Tighe, Edward P.
Cheng, Charibeth K.
author_sort Tighe, Edward P.
title Modeling personality traits of Filipino twitter users
title_short Modeling personality traits of Filipino twitter users
title_full Modeling personality traits of Filipino twitter users
title_fullStr Modeling personality traits of Filipino twitter users
title_full_unstemmed Modeling personality traits of Filipino twitter users
title_sort modeling personality traits of filipino twitter users
publisher Animo Repository
publishDate 2018
url https://animorepository.dlsu.edu.ph/faculty_research/13396
_version_ 1816861348079861760