BiasFinder: Metamorphic test generation to uncover bias for sentiment analysis systems

Artificial intelligence systems, such as Sentiment Analysis (SA) systems, typically learn from large amounts of data that may reflect human bias. Consequently, such systems may exhibit unintended demographic bias against specific characteristics (e.g., gender, occupation, country-of-origin, etc.). S...

Full description

Saved in:

Bibliographic Details
Main Authors:	ASYROFI, Muhammad Hilmi, YANG, Zhou, IMAM NUR BANI YUSUF, KANG, Hong Jin, Ferdian, Thung, LO, David
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2022
Subjects:	sentiment analysis test case generation metamorphic testing bias fairness bug Artificial Intelligence and Robotics Software Engineering
Online Access:	https://ink.library.smu.edu.sg/sis_research/7611 https://ink.library.smu.edu.sg/context/sis_research/article/8614/viewcontent/2102.01859.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-8614
record_format	dspace
spelling	sg-smu-ink.sis_research-86142022-12-22T03:27:09Z BiasFinder: Metamorphic test generation to uncover bias for sentiment analysis systems ASYROFI, Muhammad Hilmi YANG, Zhou IMAM NUR BANI YUSUF, KANG, Hong Jin Ferdian, Thung LO, David Artificial intelligence systems, such as Sentiment Analysis (SA) systems, typically learn from large amounts of data that may reflect human bias. Consequently, such systems may exhibit unintended demographic bias against specific characteristics (e.g., gender, occupation, country-of-origin, etc.). Such bias manifests in an SA system when it predicts different sentiments for similar texts that differ only in the characteristic of individuals described. To automatically uncover bias in SA systems, this paper presents BiasFinder, an approach that can discover biased predictions in SA systems via metamorphic testing. A key feature of BiasFinder is the automatic curation of suitable templates from any given text inputs, using various Natural Language Processing (NLP) techniques to identify words that describe demographic characteristics. Next, BiasFinder generates new texts from these templates by mutating words associated with a class of a characteristic (e.g., gender-specific words such as female names, “she”, “her”). These texts are then used to tease out bias in an SA system. BiasFinder identifies a bias-uncovering test case (BTC) when an SA system predicts different sentiments for texts that differ only in words associated with a different class (e.g., male vs. female) of a target characteristic (e.g., gender). We evaluate BiasFinder on 10 SA systems and 2 large scale datasets, and the results show that BiasFinder can create more BTCs than two popular baselines. We also conduct an annotation study and find that human annotators consistently think that test cases generated by BiasFinder are more fluent than the two baselines. 2022-12-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7611 info:doi/10.1109/TSE.2021.3136169 https://ink.library.smu.edu.sg/context/sis_research/article/8614/viewcontent/2102.01859.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University sentiment analysis test case generation metamorphic testing bias fairness bug Artificial Intelligence and Robotics Software Engineering
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	sentiment analysis test case generation metamorphic testing bias fairness bug Artificial Intelligence and Robotics Software Engineering
spellingShingle	sentiment analysis test case generation metamorphic testing bias fairness bug Artificial Intelligence and Robotics Software Engineering ASYROFI, Muhammad Hilmi YANG, Zhou IMAM NUR BANI YUSUF, KANG, Hong Jin Ferdian, Thung LO, David BiasFinder: Metamorphic test generation to uncover bias for sentiment analysis systems
description	Artificial intelligence systems, such as Sentiment Analysis (SA) systems, typically learn from large amounts of data that may reflect human bias. Consequently, such systems may exhibit unintended demographic bias against specific characteristics (e.g., gender, occupation, country-of-origin, etc.). Such bias manifests in an SA system when it predicts different sentiments for similar texts that differ only in the characteristic of individuals described. To automatically uncover bias in SA systems, this paper presents BiasFinder, an approach that can discover biased predictions in SA systems via metamorphic testing. A key feature of BiasFinder is the automatic curation of suitable templates from any given text inputs, using various Natural Language Processing (NLP) techniques to identify words that describe demographic characteristics. Next, BiasFinder generates new texts from these templates by mutating words associated with a class of a characteristic (e.g., gender-specific words such as female names, “she”, “her”). These texts are then used to tease out bias in an SA system. BiasFinder identifies a bias-uncovering test case (BTC) when an SA system predicts different sentiments for texts that differ only in words associated with a different class (e.g., male vs. female) of a target characteristic (e.g., gender). We evaluate BiasFinder on 10 SA systems and 2 large scale datasets, and the results show that BiasFinder can create more BTCs than two popular baselines. We also conduct an annotation study and find that human annotators consistently think that test cases generated by BiasFinder are more fluent than the two baselines.
format	text
author	ASYROFI, Muhammad Hilmi YANG, Zhou IMAM NUR BANI YUSUF, KANG, Hong Jin Ferdian, Thung LO, David
author_facet	ASYROFI, Muhammad Hilmi YANG, Zhou IMAM NUR BANI YUSUF, KANG, Hong Jin Ferdian, Thung LO, David
author_sort	ASYROFI, Muhammad Hilmi
title	BiasFinder: Metamorphic test generation to uncover bias for sentiment analysis systems
title_short	BiasFinder: Metamorphic test generation to uncover bias for sentiment analysis systems
title_full	BiasFinder: Metamorphic test generation to uncover bias for sentiment analysis systems
title_fullStr	BiasFinder: Metamorphic test generation to uncover bias for sentiment analysis systems
title_full_unstemmed	BiasFinder: Metamorphic test generation to uncover bias for sentiment analysis systems
title_sort	biasfinder: metamorphic test generation to uncover bias for sentiment analysis systems
publisher	Institutional Knowledge at Singapore Management University
publishDate	2022
url	https://ink.library.smu.edu.sg/sis_research/7611 https://ink.library.smu.edu.sg/context/sis_research/article/8614/viewcontent/2102.01859.pdf
_version_	1770576394097524736

BiasFinder: Metamorphic test generation to uncover bias for sentiment analysis systems

Similar Items