Determining Linguistic Markers in Cognitive Distortions from COVID-19 Pandemic-Related Reddit Texts

Distorted thoughts may signify underlying mental illness, and when detected early, may serve as preventive measure to a more serious condition. A significant shift to more pronounced negative sentiments has been observed in the Social Media Platform, Reddit, during the onset of the COVID-19 Pandemic...

Full description

Saved in:

Bibliographic Details
Main Authors:	Aureus, Jelly P, Estuar, Ma. Regina Justina E, Mapua, Dorothy C, Abao, Roland P, Cataluña, Anna Angeline M
Format:	text
Published:	Archīum Ateneo 2021
Subjects:	COVID-19 Support vector machines Social networking (online) Pandemics Computational modeling Machine learning Linguistics Cognitive Distortions Mental Health Social Mining Machine Learning Natural Language Processing Text Classification Cognitive Psychology Computer Sciences Databases and Information Systems Mental and Social Health
Online Access:	https://archium.ateneo.edu/discs-faculty-pubs/328 https://ieeexplore.ieee.org/document/9681367
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Ateneo De Manila University

id	ph-ateneo-arc.discs-faculty-pubs-1329
record_format	eprints
spelling	ph-ateneo-arc.discs-faculty-pubs-13292022-05-02T06:12:25Z Determining Linguistic Markers in Cognitive Distortions from COVID-19 Pandemic-Related Reddit Texts Aureus, Jelly P Estuar, Ma. Regina Justina E Mapua, Dorothy C Abao, Roland P Cataluña, Anna Angeline M Distorted thoughts may signify underlying mental illness, and when detected early, may serve as preventive measure to a more serious condition. A significant shift to more pronounced negative sentiments has been observed in the Social Media Platform, Reddit, during the onset of the COVID-19 Pandemic. Individuals who engage in these platforms post and comment to express thoughts and feelings. This study aims to determine features that can help detect the presence of distorted thoughts, known as cognitive distortions, in a COVID-19 pandemic-related texts. Texts were extracted from a COVID-19 Support Group in Reddit and verified through annotation for presence or absence of cognitive distortions. Linguistic features were extracted using R and LIWC to determine the best set of features that can distinguish distorted from non-distorted texts. Results showed that cognitive distortions have distinguishable features in COVID-19 Pandemic-related texts. Specifically, results of Independent Samples T-test showed that distorted texts had significantly higher scores on: word count, sentiment score, authenticity, and usage of the following words: function words, pronouns in general, first-person singular pronoun, impersonal pronouns, verbs, interrogatives, positive emotions, cognitive processes on insights, discrepancy, and certainty, present-tense verbs, future-tense verbs and swear words. Further tests using Naive Bayes and Linear SVM machine learning model showed that some of these significant features can indeed help detect whether a sentence is distorted or not. Results from this study can be used to develop detection models on cognitive distortions. 2021-12-01T08:00:00Z text https://archium.ateneo.edu/discs-faculty-pubs/328 https://ieeexplore.ieee.org/document/9681367 Department of Information Systems & Computer Science Faculty Publications Archīum Ateneo COVID-19 Support vector machines Social networking (online) Pandemics Computational modeling Machine learning Linguistics Cognitive Distortions Mental Health Social Mining Machine Learning Natural Language Processing Text Classification COVID-19 Cognitive Psychology Computer Sciences Databases and Information Systems Mental and Social Health
institution	Ateneo De Manila University
building	Ateneo De Manila University Library
continent	Asia
country	Philippines Philippines
content_provider	Ateneo De Manila University Library
collection	archium.Ateneo Institutional Repository
topic	COVID-19 Support vector machines Social networking (online) Pandemics Computational modeling Machine learning Linguistics Cognitive Distortions Mental Health Social Mining Machine Learning Natural Language Processing Text Classification COVID-19 Cognitive Psychology Computer Sciences Databases and Information Systems Mental and Social Health
spellingShingle	COVID-19 Support vector machines Social networking (online) Pandemics Computational modeling Machine learning Linguistics Cognitive Distortions Mental Health Social Mining Machine Learning Natural Language Processing Text Classification COVID-19 Cognitive Psychology Computer Sciences Databases and Information Systems Mental and Social Health Aureus, Jelly P Estuar, Ma. Regina Justina E Mapua, Dorothy C Abao, Roland P Cataluña, Anna Angeline M Determining Linguistic Markers in Cognitive Distortions from COVID-19 Pandemic-Related Reddit Texts
description	Distorted thoughts may signify underlying mental illness, and when detected early, may serve as preventive measure to a more serious condition. A significant shift to more pronounced negative sentiments has been observed in the Social Media Platform, Reddit, during the onset of the COVID-19 Pandemic. Individuals who engage in these platforms post and comment to express thoughts and feelings. This study aims to determine features that can help detect the presence of distorted thoughts, known as cognitive distortions, in a COVID-19 pandemic-related texts. Texts were extracted from a COVID-19 Support Group in Reddit and verified through annotation for presence or absence of cognitive distortions. Linguistic features were extracted using R and LIWC to determine the best set of features that can distinguish distorted from non-distorted texts. Results showed that cognitive distortions have distinguishable features in COVID-19 Pandemic-related texts. Specifically, results of Independent Samples T-test showed that distorted texts had significantly higher scores on: word count, sentiment score, authenticity, and usage of the following words: function words, pronouns in general, first-person singular pronoun, impersonal pronouns, verbs, interrogatives, positive emotions, cognitive processes on insights, discrepancy, and certainty, present-tense verbs, future-tense verbs and swear words. Further tests using Naive Bayes and Linear SVM machine learning model showed that some of these significant features can indeed help detect whether a sentence is distorted or not. Results from this study can be used to develop detection models on cognitive distortions.
format	text
author	Aureus, Jelly P Estuar, Ma. Regina Justina E Mapua, Dorothy C Abao, Roland P Cataluña, Anna Angeline M
author_facet	Aureus, Jelly P Estuar, Ma. Regina Justina E Mapua, Dorothy C Abao, Roland P Cataluña, Anna Angeline M
author_sort	Aureus, Jelly P
title	Determining Linguistic Markers in Cognitive Distortions from COVID-19 Pandemic-Related Reddit Texts
title_short	Determining Linguistic Markers in Cognitive Distortions from COVID-19 Pandemic-Related Reddit Texts
title_full	Determining Linguistic Markers in Cognitive Distortions from COVID-19 Pandemic-Related Reddit Texts
title_fullStr	Determining Linguistic Markers in Cognitive Distortions from COVID-19 Pandemic-Related Reddit Texts
title_full_unstemmed	Determining Linguistic Markers in Cognitive Distortions from COVID-19 Pandemic-Related Reddit Texts
title_sort	determining linguistic markers in cognitive distortions from covid-19 pandemic-related reddit texts
publisher	Archīum Ateneo
publishDate	2021
url	https://archium.ateneo.edu/discs-faculty-pubs/328 https://ieeexplore.ieee.org/document/9681367
_version_	1733052864328105984

Determining Linguistic Markers in Cognitive Distortions from COVID-19 Pandemic-Related Reddit Texts

Similar Items