Abstract
Many modern machine learning algorithms mitigate bias by enforcing fairness constraints across coarsely-defined groups related to a sensitive attribute like gender or race. However, these algorithms seldom account for within-group heterogeneity and biases that may disproportionately affect some members of a group. In this work, we characterize Social Norm Bias (SNoB), a subtle but consequential type of algorithmic discrimination that may be exhibited by machine learning models, even when these systems achieve group fairness objectives. We study this issue through the lens of gender bias in occupation classification. We quantify SNoB by measuring how an algorithm’s predictions are associated with conformity to inferred gender norms. When predicting if an individual belongs to a male-dominated occupation, this framework reveals that “fair” classifiers still favor biographies written in ways that align with inferred masculine norms. We compare SNoB across algorithmic fairness techniques and show that it is frequently a residual bias, and post-processing approaches do not mitigate this type of bias at all.
| Original language | English |
|---|---|
| Pages (from-to) | 1858-1884 |
| Number of pages | 27 |
| Journal | Data Mining and Knowledge Discovery |
| Volume | 37 |
| Issue number | 5 |
| DOIs | |
| Publication status | Published - Sept 2023 |
| Externally published | Yes |
Keywords
- Algorithmic fairness
- Automated hiring
- Gender bias
- Group fairness
- Occupation classification
- Online recruiting
- Social norms
Fingerprint
Dive into the research topics of 'Social norm bias: residual harms of fairness-aware algorithms'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver