Determining Linguistic Markers in Cognitive Distortions from COVID-19 Pandemic-Related Reddit Texts

Distorted thoughts may signify underlying mental illness, and when detected early, may serve as preventive measure to a more serious condition. A significant shift to more pronounced negative sentiments has been observed in the Social Media Platform, Reddit, during the onset of the COVID-19 Pandemic...

Full description

Saved in:
Bibliographic Details
Main Authors: Aureus, Jelly P, Estuar, Ma. Regina Justina E, Mapua, Dorothy C, Abao, Roland P, Cataluña, Anna Angeline M
Format: text
Published: Archīum Ateneo 2021
Subjects:
Online Access:https://archium.ateneo.edu/discs-faculty-pubs/328
https://ieeexplore.ieee.org/document/9681367
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Ateneo De Manila University
id ph-ateneo-arc.discs-faculty-pubs-1329
record_format eprints
spelling ph-ateneo-arc.discs-faculty-pubs-13292022-05-02T06:12:25Z Determining Linguistic Markers in Cognitive Distortions from COVID-19 Pandemic-Related Reddit Texts Aureus, Jelly P Estuar, Ma. Regina Justina E Mapua, Dorothy C Abao, Roland P Cataluña, Anna Angeline M Distorted thoughts may signify underlying mental illness, and when detected early, may serve as preventive measure to a more serious condition. A significant shift to more pronounced negative sentiments has been observed in the Social Media Platform, Reddit, during the onset of the COVID-19 Pandemic. Individuals who engage in these platforms post and comment to express thoughts and feelings. This study aims to determine features that can help detect the presence of distorted thoughts, known as cognitive distortions, in a COVID-19 pandemic-related texts. Texts were extracted from a COVID-19 Support Group in Reddit and verified through annotation for presence or absence of cognitive distortions. Linguistic features were extracted using R and LIWC to determine the best set of features that can distinguish distorted from non-distorted texts. Results showed that cognitive distortions have distinguishable features in COVID-19 Pandemic-related texts. Specifically, results of Independent Samples T-test showed that distorted texts had significantly higher scores on: word count, sentiment score, authenticity, and usage of the following words: function words, pronouns in general, first-person singular pronoun, impersonal pronouns, verbs, interrogatives, positive emotions, cognitive processes on insights, discrepancy, and certainty, present-tense verbs, future-tense verbs and swear words. Further tests using Naive Bayes and Linear SVM machine learning model showed that some of these significant features can indeed help detect whether a sentence is distorted or not. Results from this study can be used to develop detection models on cognitive distortions. 2021-12-01T08:00:00Z text https://archium.ateneo.edu/discs-faculty-pubs/328 https://ieeexplore.ieee.org/document/9681367 Department of Information Systems & Computer Science Faculty Publications Archīum Ateneo COVID-19 Support vector machines Social networking (online) Pandemics Computational modeling Machine learning Linguistics Cognitive Distortions Mental Health Social Mining Machine Learning Natural Language Processing Text Classification COVID-19 Cognitive Psychology Computer Sciences Databases and Information Systems Mental and Social Health
institution Ateneo De Manila University
building Ateneo De Manila University Library
continent Asia
country Philippines
Philippines
content_provider Ateneo De Manila University Library
collection archium.Ateneo Institutional Repository
topic COVID-19
Support vector machines
Social networking (online)
Pandemics
Computational modeling
Machine learning
Linguistics
Cognitive Distortions
Mental Health
Social Mining
Machine Learning
Natural Language Processing
Text Classification
COVID-19
Cognitive Psychology
Computer Sciences
Databases and Information Systems
Mental and Social Health
spellingShingle COVID-19
Support vector machines
Social networking (online)
Pandemics
Computational modeling
Machine learning
Linguistics
Cognitive Distortions
Mental Health
Social Mining
Machine Learning
Natural Language Processing
Text Classification
COVID-19
Cognitive Psychology
Computer Sciences
Databases and Information Systems
Mental and Social Health
Aureus, Jelly P
Estuar, Ma. Regina Justina E
Mapua, Dorothy C
Abao, Roland P
Cataluña, Anna Angeline M
Determining Linguistic Markers in Cognitive Distortions from COVID-19 Pandemic-Related Reddit Texts
description Distorted thoughts may signify underlying mental illness, and when detected early, may serve as preventive measure to a more serious condition. A significant shift to more pronounced negative sentiments has been observed in the Social Media Platform, Reddit, during the onset of the COVID-19 Pandemic. Individuals who engage in these platforms post and comment to express thoughts and feelings. This study aims to determine features that can help detect the presence of distorted thoughts, known as cognitive distortions, in a COVID-19 pandemic-related texts. Texts were extracted from a COVID-19 Support Group in Reddit and verified through annotation for presence or absence of cognitive distortions. Linguistic features were extracted using R and LIWC to determine the best set of features that can distinguish distorted from non-distorted texts. Results showed that cognitive distortions have distinguishable features in COVID-19 Pandemic-related texts. Specifically, results of Independent Samples T-test showed that distorted texts had significantly higher scores on: word count, sentiment score, authenticity, and usage of the following words: function words, pronouns in general, first-person singular pronoun, impersonal pronouns, verbs, interrogatives, positive emotions, cognitive processes on insights, discrepancy, and certainty, present-tense verbs, future-tense verbs and swear words. Further tests using Naive Bayes and Linear SVM machine learning model showed that some of these significant features can indeed help detect whether a sentence is distorted or not. Results from this study can be used to develop detection models on cognitive distortions.
format text
author Aureus, Jelly P
Estuar, Ma. Regina Justina E
Mapua, Dorothy C
Abao, Roland P
Cataluña, Anna Angeline M
author_facet Aureus, Jelly P
Estuar, Ma. Regina Justina E
Mapua, Dorothy C
Abao, Roland P
Cataluña, Anna Angeline M
author_sort Aureus, Jelly P
title Determining Linguistic Markers in Cognitive Distortions from COVID-19 Pandemic-Related Reddit Texts
title_short Determining Linguistic Markers in Cognitive Distortions from COVID-19 Pandemic-Related Reddit Texts
title_full Determining Linguistic Markers in Cognitive Distortions from COVID-19 Pandemic-Related Reddit Texts
title_fullStr Determining Linguistic Markers in Cognitive Distortions from COVID-19 Pandemic-Related Reddit Texts
title_full_unstemmed Determining Linguistic Markers in Cognitive Distortions from COVID-19 Pandemic-Related Reddit Texts
title_sort determining linguistic markers in cognitive distortions from covid-19 pandemic-related reddit texts
publisher Archīum Ateneo
publishDate 2021
url https://archium.ateneo.edu/discs-faculty-pubs/328
https://ieeexplore.ieee.org/document/9681367
_version_ 1733052864328105984