Determining Linguistic Markers in Cognitive Distortions from COVID-19 Pandemic-Related Reddit Texts
Distorted thoughts may signify underlying mental illness, and when detected early, may serve as preventive measure to a more serious condition. A significant shift to more pronounced negative sentiments has been observed in the Social Media Platform, Reddit, during the onset of the COVID-19 Pandemic...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | text |
Published: |
Archīum Ateneo
2021
|
Subjects: | |
Online Access: | https://archium.ateneo.edu/discs-faculty-pubs/328 https://ieeexplore.ieee.org/document/9681367 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Ateneo De Manila University |
id |
ph-ateneo-arc.discs-faculty-pubs-1329 |
---|---|
record_format |
eprints |
spelling |
ph-ateneo-arc.discs-faculty-pubs-13292022-05-02T06:12:25Z Determining Linguistic Markers in Cognitive Distortions from COVID-19 Pandemic-Related Reddit Texts Aureus, Jelly P Estuar, Ma. Regina Justina E Mapua, Dorothy C Abao, Roland P Cataluña, Anna Angeline M Distorted thoughts may signify underlying mental illness, and when detected early, may serve as preventive measure to a more serious condition. A significant shift to more pronounced negative sentiments has been observed in the Social Media Platform, Reddit, during the onset of the COVID-19 Pandemic. Individuals who engage in these platforms post and comment to express thoughts and feelings. This study aims to determine features that can help detect the presence of distorted thoughts, known as cognitive distortions, in a COVID-19 pandemic-related texts. Texts were extracted from a COVID-19 Support Group in Reddit and verified through annotation for presence or absence of cognitive distortions. Linguistic features were extracted using R and LIWC to determine the best set of features that can distinguish distorted from non-distorted texts. Results showed that cognitive distortions have distinguishable features in COVID-19 Pandemic-related texts. Specifically, results of Independent Samples T-test showed that distorted texts had significantly higher scores on: word count, sentiment score, authenticity, and usage of the following words: function words, pronouns in general, first-person singular pronoun, impersonal pronouns, verbs, interrogatives, positive emotions, cognitive processes on insights, discrepancy, and certainty, present-tense verbs, future-tense verbs and swear words. Further tests using Naive Bayes and Linear SVM machine learning model showed that some of these significant features can indeed help detect whether a sentence is distorted or not. Results from this study can be used to develop detection models on cognitive distortions. 2021-12-01T08:00:00Z text https://archium.ateneo.edu/discs-faculty-pubs/328 https://ieeexplore.ieee.org/document/9681367 Department of Information Systems & Computer Science Faculty Publications Archīum Ateneo COVID-19 Support vector machines Social networking (online) Pandemics Computational modeling Machine learning Linguistics Cognitive Distortions Mental Health Social Mining Machine Learning Natural Language Processing Text Classification COVID-19 Cognitive Psychology Computer Sciences Databases and Information Systems Mental and Social Health |
institution |
Ateneo De Manila University |
building |
Ateneo De Manila University Library |
continent |
Asia |
country |
Philippines Philippines |
content_provider |
Ateneo De Manila University Library |
collection |
archium.Ateneo Institutional Repository |
topic |
COVID-19 Support vector machines Social networking (online) Pandemics Computational modeling Machine learning Linguistics Cognitive Distortions Mental Health Social Mining Machine Learning Natural Language Processing Text Classification COVID-19 Cognitive Psychology Computer Sciences Databases and Information Systems Mental and Social Health |
spellingShingle |
COVID-19 Support vector machines Social networking (online) Pandemics Computational modeling Machine learning Linguistics Cognitive Distortions Mental Health Social Mining Machine Learning Natural Language Processing Text Classification COVID-19 Cognitive Psychology Computer Sciences Databases and Information Systems Mental and Social Health Aureus, Jelly P Estuar, Ma. Regina Justina E Mapua, Dorothy C Abao, Roland P Cataluña, Anna Angeline M Determining Linguistic Markers in Cognitive Distortions from COVID-19 Pandemic-Related Reddit Texts |
description |
Distorted thoughts may signify underlying mental illness, and when detected early, may serve as preventive measure to a more serious condition. A significant shift to more pronounced negative sentiments has been observed in the Social Media Platform, Reddit, during the onset of the COVID-19 Pandemic. Individuals who engage in these platforms post and comment to express thoughts and feelings. This study aims to determine features that can help detect the presence of distorted thoughts, known as cognitive distortions, in a COVID-19 pandemic-related texts. Texts were extracted from a COVID-19 Support Group in Reddit and verified through annotation for presence or absence of cognitive distortions. Linguistic features were extracted using R and LIWC to determine the best set of features that can distinguish distorted from non-distorted texts. Results showed that cognitive distortions have distinguishable features in COVID-19 Pandemic-related texts. Specifically, results of Independent Samples T-test showed that distorted texts had significantly higher scores on: word count, sentiment score, authenticity, and usage of the following words: function words, pronouns in general, first-person singular pronoun, impersonal pronouns, verbs, interrogatives, positive emotions, cognitive processes on insights, discrepancy, and certainty, present-tense verbs, future-tense verbs and swear words. Further tests using Naive Bayes and Linear SVM machine learning model showed that some of these significant features can indeed help detect whether a sentence is distorted or not. Results from this study can be used to develop detection models on cognitive distortions. |
format |
text |
author |
Aureus, Jelly P Estuar, Ma. Regina Justina E Mapua, Dorothy C Abao, Roland P Cataluña, Anna Angeline M |
author_facet |
Aureus, Jelly P Estuar, Ma. Regina Justina E Mapua, Dorothy C Abao, Roland P Cataluña, Anna Angeline M |
author_sort |
Aureus, Jelly P |
title |
Determining Linguistic Markers in Cognitive Distortions from COVID-19 Pandemic-Related Reddit Texts |
title_short |
Determining Linguistic Markers in Cognitive Distortions from COVID-19 Pandemic-Related Reddit Texts |
title_full |
Determining Linguistic Markers in Cognitive Distortions from COVID-19 Pandemic-Related Reddit Texts |
title_fullStr |
Determining Linguistic Markers in Cognitive Distortions from COVID-19 Pandemic-Related Reddit Texts |
title_full_unstemmed |
Determining Linguistic Markers in Cognitive Distortions from COVID-19 Pandemic-Related Reddit Texts |
title_sort |
determining linguistic markers in cognitive distortions from covid-19 pandemic-related reddit texts |
publisher |
Archīum Ateneo |
publishDate |
2021 |
url |
https://archium.ateneo.edu/discs-faculty-pubs/328 https://ieeexplore.ieee.org/document/9681367 |
_version_ |
1733052864328105984 |