Proceedings Article10.1145/3303772.3303791
Evaluating the Fairness of Predictive Student Models Through Slicing Analysis
Josh Gardner,Christopher Brooks,Ryan S. Baker +2 more
- 04 Mar 2019
- pp 225-234
175
TL;DR: This work provides a framework for quantifying and understanding how predictive models might inadvertently privilege, or disparately impact, different student subgroups and suggests that learning analytics researchers and practitioners can use slicing analysis to improve model fairness without necessarily sacrificing performance.
read more
Abstract: Predictive modeling has been a core area of learning analytics research over the past decade, with such models currently deployed in a variety of educational contexts from MOOCs to K-12. However, analyses of the differential effectiveness of these models across demographic, identity, or other groups has been scarce. In this paper, we present a method for evaluating unfairness in predictive student models. We define this in terms of differential accuracy between subgroups, and measure it using a new metric we term the Absolute Between-ROC Area (ABROCA). We demonstrate the proposed method through a gender-based "slicing analysis" using five different models replicated from other works and a dataset of 44 unique MOOCs and over four million learners. Our results demonstrate (1) significant differences in model fairness according to (a) statistical algorithm and (b) feature set used; (2) that the gender imbalance ratio, curricular area, and specific course used for a model all display significant association with the value of the ABROCA statistic; and (3) that there is not evidence of a strict tradeoff between performance and fairness. This work provides a framework for quantifying and understanding how predictive models might inadvertently privilege, or disparately impact, different student subgroups. Furthermore, our results suggest that learning analytics researchers and practitioners can use slicing analysis to improve model fairness without necessarily sacrificing performance.1
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Proceedings Article
Towards Accurate and Fair Prediction of College Success: Evaluating Different Sources of Student Data.
Renzhe Yu,Qiujie Li,Christian Fischer,Shayan Doroudi,Di Xu +4 more
- 01 Jul 2020
TL;DR: This work examined the utility of institutional data, learning management system (LMS) data, and survey data for accurately and fairly predicting short-term and long-term student success and found that institutional data and LMS data both have decent predictive power, but survey data shows very little predictive utility.
Survey on Explainable AI: From Approaches, Limitations and Applications Aspects
Wenli Yang,Yuchen Wei,Hanyu Wei,Yanyu Chen,Guan-Ru Huang,Xiang Li,Renjie Li,Naimeng Yao,Xinyi Wang,Xiaodong Gu,Muhammad Bilal Amin,Byeong Kang +11 more
TL;DR: A comprehensive overview of recent research on XAI approaches from three well-defined taxonomies is presented and an in-depth analysis and summary of the status and prospects of XAI applications in several key areas where reliable explanations are urgently needed to avoid mistakes in decision-making is offered.
71
Towards strengthening links between learning analytics and assessment: Challenges and potentials of a promising new bond
TL;DR: In this article , the authors highlight pressing scientific and practical challenges and opportunities in the connections between learning analytics and assessment that will require interdisciplinary teams to address: task design, analysis of learning progressions, trustworthiness, and fairness.
69
Personalized Education in the Artificial Intelligence Era: What to Expect Next
TL;DR: In this paper, the authors investigate the challenges of AI/ML-based personalized education and discuss potential solutions, including compensating for the adverse effects of the absence of peers, creating and maintaining motivations for learning, increasing diversity, removing the biases induced by data and algorithms, and so on.
64
Incorporating AI and learning analytics to build trustworthy peer assessment systems
TL;DR: In this paper , the authors present and evaluate their AI-assisted and analytical approaches that aim to offer guidelines and assistance to student-assessors during individual reviews to provide better feedback, integrate probabilistic and text analysis inference models to improve the accuracy of the assigned grades, develop feedback on reviews strategies that enable peer assessors to review the work of each other, and employ a spot-checking mechanism to assist instructors in optimally overseeing the peer assessment process.
62
References
The meaning and use of the area under a receiver operating characteristic (ROC) curve.
TL;DR: A representation and interpretation of the area under a receiver operating characteristic (ROC) curve obtained by the "rating" method, or by mathematical predictions based on patient characteristics, is presented and it is shown that in such a setting the area represents the probability that a randomly chosen diseased subject is (correctly) rated or ranked with greater suspicion than a random chosen non-diseased subject.
21.8K
•Proceedings Article
Equality of opportunity in supervised learning
Moritz Hardt,Eric Price,Nathan Srebro +2 more
- 05 Dec 2016
TL;DR: This work proposes a criterion for discrimination against a specified sensitive attribute in supervised learning, where the goal is to predict some target based on available features and shows how to optimally adjust any learned predictor so as to remove discrimination according to this definition.
Fairness through awareness
Cynthia Dwork,Moritz Hardt,Toniann Pitassi,Omer Reingold,Richard S. Zemel +4 more
- 08 Jan 2012
TL;DR: A framework for fair classification comprising a (hypothetical) task-specific metric for determining the degree to which individuals are similar with respect to the classification task at hand and an algorithm for maximizing utility subject to the fairness constraint, that similar individuals are treated similarly is presented.
Big Data's Disparate Impact
Solon Barocas,Andrew D. Selbst +1 more
TL;DR: In the absence of a demonstrable intent to discriminate, the best doctrinal hope for data mining's victims would seem to lie in disparate impact doctrine as discussed by the authors, which holds that a practice can be justified as a business necessity when its outcomes are predictive of future employment outcomes, and data mining is specifically designed to find such statistical correlations.
2.8K
•Posted Content
Fairness Through Awareness
TL;DR: In this article, the authors proposed a framework for fair classification comprising a task-specific metric for determining the degree to which individuals are similar with respect to the classification task at hand, and an algorithm for maximizing utility subject to the fairness constraint that similar individuals are treated similarly.
2.8K
Related Papers (5)
Cynthia Dwork,Moritz Hardt,Toniann Pitassi,Omer Reingold,Richard S. Zemel +4 more
- 08 Jan 2012
Moritz Hardt,Eric Price,Nathan Srebro +2 more
- 05 Dec 2016