On negative results when using sentiment analysis tools for software engineering research

Recent years have seen an increasing attention to social aspects of software engineering, including studies of emotions and sentiments experienced and expressed by the software developers. Most of these studies reuse existing sentiment analysis tools such as SentiStrength and NLTK. However, these to...

Full description

Saved in:
Bibliographic Details
Main Authors: JONGELING, Robbert, SARKAR, Proshanta, DATTA, Subhajit, SEREBRENIK, Alexander
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2017
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/5584
https://ink.library.smu.edu.sg/context/sis_research/article/6587/viewcontent/Jongeling2017_Article_OnNegativeResultsWhenUsingSent_pvoa.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-6587
record_format dspace
spelling sg-smu-ink.sis_research-65872021-01-07T14:04:13Z On negative results when using sentiment analysis tools for software engineering research JONGELING, Robbert SARKAR, Proshanta DATTA, Subhajit SEREBRENIK, Alexander Recent years have seen an increasing attention to social aspects of software engineering, including studies of emotions and sentiments experienced and expressed by the software developers. Most of these studies reuse existing sentiment analysis tools such as SentiStrength and NLTK. However, these tools have been trained on product reviews and movie reviews and, therefore, their results might not be applicable in the software engineering domain. In this paper we study whether the sentiment analysis tools agree with the sentiment recognized by human evaluators (as reported in an earlier study) as well as with each other. Furthermore, we evaluate the impact of the choice of a sentiment analysis tool on software engineering studies by conducting a simple study of differences in issue resolution times for positive, negative and neutral texts. We repeat the study for seven datasets (issue trackers and Stack Overflow questions) and different sentiment analysis tools and observe that the disagreement between the tools can lead to diverging conclusions. Finally, we perform two replications of previously published studies and observe that the results of those studies cannot be confirmed when a different sentiment analysis tool is used. 2017-10-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/5584 info:doi/10.1007/s10664-016-9493-x https://ink.library.smu.edu.sg/context/sis_research/article/6587/viewcontent/Jongeling2017_Article_OnNegativeResultsWhenUsingSent_pvoa.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Negative results Replication study Sentiment analysis tools Numerical Analysis and Scientific Computing Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Negative results
Replication study
Sentiment analysis tools
Numerical Analysis and Scientific Computing
Software Engineering
spellingShingle Negative results
Replication study
Sentiment analysis tools
Numerical Analysis and Scientific Computing
Software Engineering
JONGELING, Robbert
SARKAR, Proshanta
DATTA, Subhajit
SEREBRENIK, Alexander
On negative results when using sentiment analysis tools for software engineering research
description Recent years have seen an increasing attention to social aspects of software engineering, including studies of emotions and sentiments experienced and expressed by the software developers. Most of these studies reuse existing sentiment analysis tools such as SentiStrength and NLTK. However, these tools have been trained on product reviews and movie reviews and, therefore, their results might not be applicable in the software engineering domain. In this paper we study whether the sentiment analysis tools agree with the sentiment recognized by human evaluators (as reported in an earlier study) as well as with each other. Furthermore, we evaluate the impact of the choice of a sentiment analysis tool on software engineering studies by conducting a simple study of differences in issue resolution times for positive, negative and neutral texts. We repeat the study for seven datasets (issue trackers and Stack Overflow questions) and different sentiment analysis tools and observe that the disagreement between the tools can lead to diverging conclusions. Finally, we perform two replications of previously published studies and observe that the results of those studies cannot be confirmed when a different sentiment analysis tool is used.
format text
author JONGELING, Robbert
SARKAR, Proshanta
DATTA, Subhajit
SEREBRENIK, Alexander
author_facet JONGELING, Robbert
SARKAR, Proshanta
DATTA, Subhajit
SEREBRENIK, Alexander
author_sort JONGELING, Robbert
title On negative results when using sentiment analysis tools for software engineering research
title_short On negative results when using sentiment analysis tools for software engineering research
title_full On negative results when using sentiment analysis tools for software engineering research
title_fullStr On negative results when using sentiment analysis tools for software engineering research
title_full_unstemmed On negative results when using sentiment analysis tools for software engineering research
title_sort on negative results when using sentiment analysis tools for software engineering research
publisher Institutional Knowledge at Singapore Management University
publishDate 2017
url https://ink.library.smu.edu.sg/sis_research/5584
https://ink.library.smu.edu.sg/context/sis_research/article/6587/viewcontent/Jongeling2017_Article_OnNegativeResultsWhenUsingSent_pvoa.pdf
_version_ 1770575517592846336