Efficient white-box fairness testing through gradient search

Deep learning (DL) systems are increasingly deployed for autonomous decision-making in a wide range of applications. Apart from the robustness and safety, fairness is also an important property that a well-designed DL system should have. To evaluate and improve individual fairness of a model, system...

Full description

Saved in:
Bibliographic Details
Main Authors: ZHANG, Lingfeng, ZHANG, Yueling, ZHANG, Min
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2021
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/6966
https://ink.library.smu.edu.sg/context/sis_research/article/7969/viewcontent/EfficientWhiteBox_2021_av.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-7969
record_format dspace
spelling sg-smu-ink.sis_research-79692022-03-04T05:52:47Z Efficient white-box fairness testing through gradient search ZHANG, Lingfeng ZHANG, Yueling ZHANG, Min Deep learning (DL) systems are increasingly deployed for autonomous decision-making in a wide range of applications. Apart from the robustness and safety, fairness is also an important property that a well-designed DL system should have. To evaluate and improve individual fairness of a model, systematic test case generation for identifying individual discriminatory instances in the input space is essential. In this paper, we propose a framework EIDIG for efficiently discovering individual fairness violation. Our technique combines a global generation phase for rapidly generating a set of diverse discriminatory seeds with a local generation phase for generating as many individual discriminatory instances as possible around these seeds under the guidance of the gradient of the model output. In each phase, prior information at successive iterations is fully exploited to accelerate convergence of iterative optimization or reduce frequency of gradient calculation. Our experimental results show that, on average, our approach EIDIG generates 19.11% more individual discriminatory instances with a speedup of 121.49% when compared with the state-of-the-art method and mitigates individual discrimination by 80.03% with a limited accuracy loss after retraining. 2021-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6966 info:doi/10.1145/3460319.3464820 https://ink.library.smu.edu.sg/context/sis_research/article/7969/viewcontent/EfficientWhiteBox_2021_av.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Fairness testing Neural networks Software bias Test case generation Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Fairness testing
Neural networks
Software bias
Test case generation
Software Engineering
spellingShingle Fairness testing
Neural networks
Software bias
Test case generation
Software Engineering
ZHANG, Lingfeng
ZHANG, Yueling
ZHANG, Min
Efficient white-box fairness testing through gradient search
description Deep learning (DL) systems are increasingly deployed for autonomous decision-making in a wide range of applications. Apart from the robustness and safety, fairness is also an important property that a well-designed DL system should have. To evaluate and improve individual fairness of a model, systematic test case generation for identifying individual discriminatory instances in the input space is essential. In this paper, we propose a framework EIDIG for efficiently discovering individual fairness violation. Our technique combines a global generation phase for rapidly generating a set of diverse discriminatory seeds with a local generation phase for generating as many individual discriminatory instances as possible around these seeds under the guidance of the gradient of the model output. In each phase, prior information at successive iterations is fully exploited to accelerate convergence of iterative optimization or reduce frequency of gradient calculation. Our experimental results show that, on average, our approach EIDIG generates 19.11% more individual discriminatory instances with a speedup of 121.49% when compared with the state-of-the-art method and mitigates individual discrimination by 80.03% with a limited accuracy loss after retraining.
format text
author ZHANG, Lingfeng
ZHANG, Yueling
ZHANG, Min
author_facet ZHANG, Lingfeng
ZHANG, Yueling
ZHANG, Min
author_sort ZHANG, Lingfeng
title Efficient white-box fairness testing through gradient search
title_short Efficient white-box fairness testing through gradient search
title_full Efficient white-box fairness testing through gradient search
title_fullStr Efficient white-box fairness testing through gradient search
title_full_unstemmed Efficient white-box fairness testing through gradient search
title_sort efficient white-box fairness testing through gradient search
publisher Institutional Knowledge at Singapore Management University
publishDate 2021
url https://ink.library.smu.edu.sg/sis_research/6966
https://ink.library.smu.edu.sg/context/sis_research/article/7969/viewcontent/EfficientWhiteBox_2021_av.pdf
_version_ 1770576149329477632