Fair Algorithms for Clustering

Open AccessProceedings Article

Fair Algorithms for Clustering

- 08 Jan 2019

Vol. 32, pp 4954-4965

181

TL;DR: This work significantly generalizes the seminal work of Chierichetti this http URL and transforms any vanilla clustering solution into a fair one incurring only a slight loss in quality.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Posted Content

Fairness in Machine Learning: A Survey.

Simon Caton, +1 more

- 04 Oct 2020

- arXiv: Learning

TL;DR: An overview of the different schools of thought and approaches to mitigating (social) biases and increase fairness in the Machine Learning literature is provided, organises approaches into the widely accepted framework of pre-processing, in- processing, and post-processing methods, subcategorizing into a further 11 method areas.

...read moreread less

495

•Posted Content

Fair Generative Modeling via Weak Supervision

Kristy Choi, +4 more

- 26 Oct 2019

- arXiv: Learning

TL;DR: In this article, a weakly supervised algorithm for overcoming dataset bias for deep generative models is presented, which requires access to an additional small, unlabeled reference dataset as the supervision signal, thus sidestepping the need for explicit labels on the underlying bias factors.

...read moreread less

104

•Proceedings Article•10.1145/3292500.3330987

Clustering without Over-Representation

Sara Ahmadian, +3 more

- 25 Jul 2019

TL;DR: This paper obtains an algorithm that has provable guarantees of performance and a simpler combinatorial algorithm for the special case of the problem where no color has an absolute majority in any cluster.

...read moreread less

99

•Journal Article•10.1007/s10618-022-00854-z

Algorithmic fairness datasets: the story so far

Alessandro Fabris, +3 more

- 03 Feb 2022

- Data Mining and Knowledge Discovery

TL;DR: In this article , the authors focus on data documentation debt by surveying over two hundred datasets employed in algorithmic fairness research, and producing standardized and searchable documentation for each of them.

...read moreread less

91

•Posted Content

Coresets for Clustering with Fairness Constraints

Lingxiao Huang, +2 more

- 20 Jun 2019

- arXiv: Data Structures and Algorithms

TL;DR: An approach to clustering with fairness constraints that involve multiple, non-disjoint types, that is also scalable and achieves a speed-up to recent fair clustering algorithms by incorporating the first known coreset construction for theFair clustering problem with thek-median objective.

...read moreread less

69

...

Expand

References

•Proceedings Article•10.1145/2783258.2783311

Certifying and Removing Disparate Impact

Michael Feldman, +4 more

- 10 Aug 2015

TL;DR: This work links disparate impact to a measure of classification accuracy that while known, has received relatively little attention and proposes a test for disparate impact based on how well the protected class can be predicted from the other attributes.

...read moreread less

2.1K

Journal Article•10.1287/MOOR.10.2.180

A Best Possible Heuristic for the k-Center Problem

Dorit S. Hochbaum, +1 more

- 01 May 1985

- Mathematics of Operations Research

TL;DR: A 2-approximation algorithm for the k-center problem with triangle inequality is presented, the key combinatorial object used is called a strong stable set, and the NP-completeness of the corresponding decision problem is proved.

...read moreread less

1.1K

•Journal Article•10.1126/SCIADV.AAO5580

The accuracy, fairness, and limits of predicting recidivism.

Julia Dressel, +1 more

- 01 Jan 2018

- Science Advances

TL;DR: It is shown that the widely used commercial risk assessment software COMPAS is no more accurate or fair than predictions made by people with little or no criminal justice expertise.

...read moreread less

1K

Journal Article•10.1016/J.ESWA.2007.12.020

The comparisons of data mining techniques for the predictive accuracy of probability of default of credit card clients

I-Cheng Yeh, +1 more

- 01 Mar 2009

- Expert Systems With Applications

TL;DR: Among the six data mining techniques, artificial neural network is the only one that can accurately estimate the real probability of default, and its regression intercept is close to zero, and regression coefficient to one.

...read moreread less

1K

•Book Chapter•10.1007/978-3-642-33486-3_3

Fairness-aware classifier with prejudice remover regularizer

Toshihiro Kamishima, +3 more

- 24 Sep 2012

TL;DR: A regularization approach is proposed that is applicable to any prediction algorithm with probabilistic discriminative models and applied to logistic regression and empirically show its effectiveness and efficiency.

...read moreread less

978

...

Expand

Fair Algorithms for Clustering

Chat with Paper

AI Agents for this Paper

Citations

Fairness in Machine Learning: A Survey.

Fair Generative Modeling via Weak Supervision

Clustering without Over-Representation

Algorithmic fairness datasets: the story so far

Coresets for Clustering with Fairness Constraints

References

Certifying and Removing Disparate Impact

A Best Possible Heuristic for the k-Center Problem

The accuracy, fairness, and limits of predicting recidivism.

The comparisons of data mining techniques for the predictive accuracy of probability of default of credit card clients

Fairness-aware classifier with prejudice remover regularizer

Related Papers (5)

Fair Clustering Through Fairlets

Fairness through awareness

Scalable Fair Clustering

Clustering without Over-Representation

Certifying and Removing Disparate Impact