-This page is part of the Responsible AI series.

Introduction
List of bias
- Cognitive Bias
- Technical Bias
Bias Metrics

Introduction

Bias and fairness are critical considerations when it comes to artificial intelligence (AI) systems. In the context of AI, bias refers to the systematic errors or deviations from an accurate or fair outcome that can occur in the development and deployment of AI algorithms.

Bias

Bias can manifest in different forms, with statistical bias and ethics bias being two distinct aspects:

Statistical bias, often referred to as algorithmic bias, is a type of bias that arises from the data used to train machine learning models. If the training data contains inherent biases or reflects historical inequalities, the resulting AI systems can perpetuate and amplify those biases. For example, if a hiring algorithm is trained on biased historical hiring data, it may inadvertently discriminate against certain demographic groups when making hiring recommendations.
Ethics bias refers to the biases that arise from the ethical judgments and values embedded in the design, development, and decision-making processes of AI systems. This type of bias can emerge when the objectives or criteria used to optimize AI algorithms reflect certain societal or individual biases. For instance, if an AI system is designed to maximize profit without considering potential social or environmental impacts, it may prioritize short-term financial gains over long-term societal well-being.

Recognizing and addressing both statistical bias and ethics bias are essential for ensuring fairness in AI systems. Mitigating statistical bias requires careful data preprocessing, ensuring representative and diverse training datasets, and employing techniques such as bias correction and fairness-aware learning. Ethics bias, on the other hand, calls for critical reflection, ethical frameworks, and stakeholder engagement throughout the AI development lifecycle to identify and challenge the underlying assumptions and values shaping the system's behavior.

Achieving fairness in AI is a multifaceted endeavor that requires a holistic approach. It involves a combination of technical solutions, regulatory measures, and ethical considerations to promote transparency, accountability, and inclusivity. By addressing statistical bias and ethics bias, we can strive towards AI systems that treat individuals equitably, avoid discrimination, and contribute to a more just and unbiased society.

List of Bias

An “Unbiased” Guide to Bias in AI. Differences between statistical bias and ethics bias. Ethical bias by proxy. Ethical bias through historical biased decisions. Ethical bias through an inappropriate choice of target.

Cognitive Bias

Inductive Bias

Technical bias

Omitted Variable Bias

Omitted Variable Bias And How To Deal With It

Bias-variance tradeoff

Bias metrics

Here is a list of metrics to do bias detection and to force the system to be "fair" according to these metrics.

Equal performance
- Equalized odds
- Predictive parity
Demographic Parity
Treatment Equality

Equal performance

Equal performance refers to the assurance that a model is equally accurate for patients in the protected and non-protected groups. Equal performance has 3 commonly discussed types:

equal sensitivity (also known as equal opportunity),
equal sensitivity and specificity, i.e. equal true positive rate and equal false positive rate (also known as equalized odds),
equal positive predictive value (commonly referred to as predictive parity).

Equalized Odds

where A is the sensitive feature and R is a binary output {+,-}. The concept was originally defined for binary-valued Y, but in 2017, Woodworth et al. generalized the concept further for multiple classes.

Use Cases

Criminal justice systems to ensure equal false positive and true positive rates across different races.
Medical diagnosis systems to ensure equal error rates across genders.
Limitations
Difficult to achieve in practice, as it requires balancing multiple rates
May conflict with other fairness criteria and overall accuracy

Sources

Equality of Odds

Predictive Parity

where A is the sensitive feature and R is a binary output {+,-}. Predictive parity ensures that the predicted positive outcome has the same precision across different groups.

Use Cases

Loan default predictions to ensure equal precision across different demographic groups
Healthcare treatment recommendations to ensure equal accuracy across different patient groups
Limitations
May not address underlying disparities in data distribution
Can conflict with other fairness metrics like equalized odds

Demographic Parity

Demographic parity or Statistical parity (also referred as acceptance rate parity or benchmarking) refers to the property of a classifier where he subjects in the protected and unprotected groups have equal probability of being assigned to the positive predicted class. This metrics consider only the predicted outcome, not the actual outcome. where A is the sensitive feature and R is a binary output {+,-}.

Use Cases

Hiring algorithms to ensure equal hiring rates across genders
Loan approval systems to provide equal approval rates across different ethnicities
Limitations
May not account for differences in group qualifications or characteristics
Can lead to reverse discrimination if strictly enforced

Treatment equality

Treatment equality focuses on balancing the ratio of false positives to false negatives across different groups. A classifier satisfies this definition if the subjects in the protected and unprotected groups have an equal ratio of FN and FP, satisfying the formula:

Use Cases

Predictive policing to balance false arrest rates and failure to arrest rates across different communities
Fraud detection systems to balance the rates of false alarms and missed frauds across different customer segments.
Limitations
Complex to calculate and interpret
May lead to trade-offs with overall model accuracy

Mitigation of bias

Fairness through unawareness (FTU) (or anti-classification). There is a very intuitive approach to fairness, which usually goes under the name of fairness through unawareness (FTU), or blindness, that prescribes not to explicitly employ sensitive features when making (automated) decisions. This is effectively a notion of individual fairness, since two individuals differing only for the value of their sensitive attributes would receive the same outcome. However, in general, FTU is subject to several drawbacks, the main being that it does not take into account possible correlations between sensitive attributes and non-sensitive attributes employed in the decision-making process. For example, an agent with the (malignant) intention to discriminate on the basis of gender could introduce in the model a proxy variable for gender (i.e. a variable highly correlated with gender) and effectively using gender information while at the same time being compliant to the FTU prescription.

How to Remove Bias in Machine Learning Training Data

Bias mitigation in classification

Fairness

Libraries

1. Fairlearn

Fairlearn is an open-source Python library developed by Microsoft. It provides tools for assessing and mitigating unfairness in machine learning models. Fairlearn offers both fairness metrics and algorithms for reducing bias. It also includes visualization tools to help interpret fairness metrics and mitigation results.

Homepage
Fairness in machine learning
Fairlearn's GitHub
AIF360 AIF360 (AI Fairness 360) is an open-source toolkit developed by IBM. It aims to help detect and mitigate bias in AI models through a comprehensive set of fairness metrics and bias mitigation algorithms. It’s known for its extensive documentation and tutorials to guide you through the process of fairness assessment.
Themis-ML Themis-ML is an open-source library focused on fairness-aware machine learning. It provides tools for implementing and evaluating fairness metrics and algorithms, particularly for binary classification tasks. It’s also designed to integrate easily with existing machine learning workflows using scikit-learn.
Fairness Indicators Fairness Indicators is a library developed by Google to help assess the fairness of machine learning models. It provides tools for evaluating fairness metrics and visualizing disparities across different demographic groups. It’s designed to work seamlessly with TensorFlow Extended and TensorFlow Model Analysis.
FairComp FairComp is an open-source library for comparing different fairness interventions and metrics. It provides a standardized framework for evaluating the trade-offs between fairness and accuracy. It also enables benchmarking of models against various fairness metrics.

Tools

Gender Equitable AI Toolkit. An iterative e-learning toolkit designed by the NGO sector for the NGO sector to enable the implementation of gender equitable AI / ML solutions
Aequitas. Aequitas is an open-source bias audit toolkit for machine learning developers, analysts, and policymakers to audit machine learning models for discrimination and bias, and to make informed and equitable decisions around developing and deploying predictive risk-assessment tools.

LLMs

The specific page for RAI in LLMs can be found here RAI LLM.

Emerging topics

Fairness-Aware AI Development: Increasing focus on integrating fairness considerations throughout the AI development lifecycle, from data collection to model deployment
Explainable AI (XAI): Development of techniques to make AI models more interpretable and transparent, helping stakeholders understand how decisions are made and identify potential biases
Intersectional Fairness: Research on addressing intersectional biases that affect individuals belonging to multiple marginalized groups, going beyond single-axis fairness metrics
Algorithmic Auditing: Growth of third-party auditing practices to assess and certify the fairness of AI systems, providing external validation and accountability
Collaborative Fairness: Collaboration between AI developers, social scientists, and ethicists to create more holistic and socially aware AI systems
Bias Mitigation Technologies: Advancements in bias mitigation techniques, including new algorithms and tools that reduce bias while maintaining model performance

To be controlled

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3547922 https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3792772 https://fairware.cs.umass.edu/papers/Verma.pdf https://www.holisticai.com/blog/holistic-ai-library-tutorial https://arxiv.org/pdf/1801.07593.pdf

Introduction

Bias

List of Bias

Cognitive Bias

Inductive Bias

Technical bias

Omitted Variable Bias

Bias-variance tradeoff

Bias metrics

Equal performance

Equalized Odds

Use Cases

Limitations

Sources

Predictive Parity

Use Cases

Limitations

Demographic Parity

Use Cases

Limitations

Treatment equality

Use Cases

Limitations

Mitigation of bias

Bias mitigation in classification

Fairness

Libraries

1. Fairlearn

Tools

LLMs

Emerging topics

Other readings

To be controlled