Fairness Testing: Testing Software for Discrimination

doi:10.1145/3106237.3106277

Open AccessProceedings Article10.1145/3106237.3106277

Fairness Testing: Testing Software for Discrimination

Sainyam Galhotra, +2 more

- 11 Sep 2017

- arXiv: Software Engineering

31

TL;DR: It is demonstrated that fairness testing is a critical aspect of the software development cycle in domains with possible discrimination and initial tools for measuring software discrimination are provided.

Abstract: This paper defines software fairness and discrimination and develops a testing-based method for measuring if and how much software discriminates, focusing on causality in discriminatory behavior. Evidence of software discrimination has been found in modern software systems that recommend criminal sentences, grant access to financial products, and determine who is allowed to participate in promotions. Our approach, Themis, generates efficient test suites to measure discrimination. Given a schema describing valid system inputs, Themis generates discrimination tests automatically and does not require an oracle. We evaluate Themis on 20 software systems, 12 of which come from prior work with explicit focus on avoiding discrimination. We find that (1) Themis is effective at discovering software discrimination, (2) state-of-the-art techniques for removing discrimination from algorithms fail in many situations, at times discriminating against as much as 98% of an input subdomain, (3) Themis optimizations are effective at producing efficient test suites for measuring discrimination, and (4) Themis is more efficient on systems that exhibit more discrimination. We thus demonstrate that fairness testing is a critical aspect of the software development cycle in domains with possible discrimination and provide initial tools for measuring software discrimination.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1145/3494672

A Review on Fairness in Machine Learning

Dana Pessach, +1 more

- 03 Feb 2022

- ACM Computing Surveys

TL;DR: An overview of the main concepts of identifying, measuring, and improving algorithmic fairness when using ML algorithms, focusing primarily on classification tasks is presented.

...read moreread less

430

•Proceedings Article•10.1145/3540250.3549093

MAAT: a novel ensemble approach to addressing fairness and performance bugs for machine learning software

Zhenpeng Chen, +3 more

- 07 Nov 2022

TL;DR: In this paper , the authors proposed a novel ensemble approach to improve fairness-performance trade-off for ML software, which combines models optimized for different objectives: fairness and ML performance.

...read moreread less

54

•Journal Article•10.1145/3583561

A Comprehensive Empirical Study of Bias Mitigation Methods for Machine Learning Classifiers

Zhenpeng Chen, +3 more

- 07 Jul 2022

- ACM Transactions on Software Engineering...

TL;DR: In this article , the authors present a large-scale comprehensive empirical study of 17 representative bias mitigation methods for Machine Learning (ML) classifiers, evaluated with 11 ML performance metrics (e.g., accuracy), 4 fairness metrics, and 20 types of fairness-performance tradeoff assessment, applied to 8 widely-adopted software decision tasks.

...read moreread less

51

•Proceedings Article•10.1145/3510003.3510202

Fairness-aware Configuration of Machine Learning Libraries

Saeid Tizpaz-Niari, +3 more

- 13 Feb 2022

TL;DR: This paper designs three search-based software testing algorithms to un-cover the precision-fairness frontier of the hyperparameter space and implements the proposed approaches in the tool Parfait-ML, which shows its effectiveness and utility over five mature ML algorithms as used in six social-critical applications.

...read moreread less

47

•Journal Article•10.1145/3514258

Toward Involving End-users in Interactive Human-in-the-loop AI Fairness

Yuri Nakao, +4 more

- 22 Apr 2022

- ACM transactions on interactive intellig...

TL;DR: This work co-designed and implemented a prototype system that allowed end-users to see why predictions were made, and then to change weights on features to “debug” fairness issues, and evaluated the use of this prototype system through an online study.

...read moreread less

42

...

Expand

References

•Proceedings Article•10.1145/2090236.2090255

Fairness through awareness

Cynthia Dwork, +4 more

- 08 Jan 2012

TL;DR: A framework for fair classification comprising a (hypothetical) task-specific metric for determining the degree to which individuals are similar with respect to the classification task at hand and an algorithm for maximizing utility subject to the fairness constraint, that similar individuals are treated similarly is presented.

...read moreread less

3.2K

•Journal Article•10.1214/09-SS057

Causal inference in statistics: An overview

Judea Pearl

- 15 Jul 2009

- Statistics Surveys

TL;DR: A review of recent advances in causal inference can be found in this article, where a general theory of causation based on the Structural Causal Model (SCM) described in Pearl (2000a) is presented.

...read moreread less

2.4K

•Proceedings Article

Counterfactual fairness

Matt J. Kusner, +3 more

- 04 Dec 2017

TL;DR: This paper develops a framework for modeling fairness using tools from causal inference and demonstrates the framework on a real-world problem of fair prediction of success in law school.

...read moreread less

1.5K

•Journal Article•10.1109/TSE.2014.2372785

The Oracle Problem in Software Testing: A Survey

Earl T. Barr, +4 more

- 01 May 2015

- IEEE Transactions on Software Engineerin...

TL;DR: This paper provides a comprehensive survey of current approaches to the test oracle problem and an analysis of trends in this important area of software testing research and practice.

...read moreread less

1K

...

Expand

Fairness Testing: Testing Software for Discrimination

Chat with Paper

AI Agents for this Paper

Citations

A Review on Fairness in Machine Learning

MAAT: a novel ensemble approach to addressing fairness and performance bugs for machine learning software

A Comprehensive Empirical Study of Bias Mitigation Methods for Machine Learning Classifiers

Fairness-aware Configuration of Machine Learning Libraries

Toward Involving End-users in Interactive Human-in-the-loop AI Fairness

References

Scikit-learn: Machine Learning in Python

Fairness through awareness

Causal inference in statistics: An overview

Counterfactual fairness

The Oracle Problem in Software Testing: A Survey

Related Papers (5)

Automatic Categorization of Software Modules

Assessment of software testing and quality assurance in natural language processing applications and a linguistically inspired approach to improving it

A practical guide to testing object-oriented software

Wikifying software artifacts

Total Recall, Language Processing, and Software Engineering