Counterfactual Fairness

TLDR

Machine learning used in high‑stakes decisions can perpetuate past biases, so models must account for historical unfairness to avoid discriminatory outcomes. The authors propose a causal‑inference framework to model fairness. Counterfactual fairness requires that a prediction remain unchanged when the individual's demographic group is altered in a counterfactual scenario. Applying the framework to law‑school admission data, the authors show it can produce fair predictions of success.

Abstract

Machine learning can impact people with legal or ethical consequences when it is used to automate decisions in areas such as insurance, lending, hiring, and predictive policing. In many of these scenarios, previous decisions have been made that are unfairly biased against certain subpopulations, for example those of a particular race, gender, or sexual orientation. Since this past data may be biased, machine learning predictors must account for this to avoid perpetuating or creating discriminatory practices. In this paper, we develop a framework for modeling fairness using tools from causal inference. Our definition of counterfactual fairness captures the intuition that a decision is fair towards an individual if it is the same in (a) the actual world and (b) a counterfactual world where the individual belonged to a different demographic group. We demonstrate our framework on a real-world problem of fair prediction of success in law school.

References

Page 1

	Year	Citations
Structural Equations with Latent Variables. Clifford C. Clogg, Kenneth A. Bollen Contemporary Sociology A Journal of Reviews Measurement TheoryFactor ModelsEducationPsychometricsPath Analysis	1991	17.6K
Testing Structural Equation Models. Clifford C. Clogg, Kenneth A. Bollen, J. Scott Long Social Forces Quality Of LifeItem Response TheoryEducationPsychometricsOrganizational Behavior	1995	8.3K
Causality: Models, Reasoning and Inference Christopher Hitchcock, Judea Pearl The Philosophical Review ReasoningEconomicsCausalityPublic HealthCausal Reasoning	2001	4.8K
Fairness through awareness Cynthia Dwork, Moritz Hardt, Toniann Pitassi, EngineeringDiscriminationFairness Through AwarenessSocial StratificationClassification Task	2012	3.3K
Causal inference in statistics: An overview Judea Pearl Statistics Surveys ReasoningCausal ModelBehavioral SciencesMedicineBias	2009	2.2K
Equality of Opportunity in Supervised Learning Moritz Hardt, Eric Price, Nathan Srebro arXiv (Cornell University) Artificial IntelligenceEngineeringMachine LearningSpecified Sensitive AttributeDiscrimination	2016	1.9K
Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings Tolga Bolukbasi, Kai-Wei Chang, James Zou, arXiv (Cornell University) EngineeringMachine LearningCross-lingual RepresentationDebiasing Word EmbeddingsCommunication	2016	1.4K
Data preprocessing techniques for classification without discrimination Faisal Kamiran, Toon Calders Knowledge and Information Systems Artificial IntelligenceEngineeringMachine LearningBiometricsDiscrimination	2011	1.2K
Learning Fair Representations Rich Zemel, Yu Wu, Kevin Swersky,	2013	992
Three naive Bayes approaches for discrimination-free classification Toon Calders, Sicco Verwer Data Mining and Knowledge Discovery Artificial IntelligenceEngineeringMachine LearningDiscriminationClassification Method	2010	760

Page 1