-This page is part of the Responsible AI series.
- Introduction
- List of bias
- Bias Metrics
Introduction
Bias and fairness are critical considerations when it comes to artificial intelligence (AI) systems. In the context of AI, bias refers to the systematic errors or deviations from an accurate or fair outcome that can occur in the development and deployment of AI algorithms.
Bias
Bias can manifest in different forms, with statistical bias and ethics bias being two distinct aspects:
- Statistical bias, often referred to as algorithmic bias, is a type of bias that arises from the data used to train machine learning models. If the training data contains inherent biases or reflects historical inequalities, the resulting AI systems can perpetuate and amplify those biases. For example, if a hiring algorithm is trained on biased historical hiring data, it may inadvertently discriminate against certain demographic groups when making hiring recommendations.
- Ethics bias refers to the biases that arise from the ethical judgments and values embedded in the design, development, and decision-making processes of AI systems. This type of bias can emerge when the objectives or criteria used to optimize AI algorithms reflect certain societal or individual biases. For instance, if an AI system is designed to maximize profit without considering potential social or environmental impacts, it may prioritize short-term financial gains over long-term societal well-being.
Recognizing and addressing both statistical bias and ethics bias are essential for ensuring fairness in AI systems. Mitigating statistical bias requires careful data preprocessing, ensuring representative and diverse training datasets, and employing techniques such as bias correction and fairness-aware learning. Ethics bias, on the other hand, calls for critical reflection, ethical frameworks, and stakeholder engagement throughout the AI development lifecycle to identify and challenge the underlying assumptions and values shaping the system's behavior.
Achieving fairness in AI is a multifaceted endeavor that requires a holistic approach. It involves a combination of technical solutions, regulatory measures, and ethical considerations to promote transparency, accountability, and inclusivity. By addressing statistical bias and ethics bias, we can strive towards AI systems that treat individuals equitably, avoid discrimination, and contribute to a more just and unbiased society.
List of Bias
- An “Unbiased” Guide to Bias in AI. Differences between statistical bias and ethics bias. Ethical bias by proxy. Ethical bias through historical biased decisions. Ethical bias through an inappropriate choice of target.
More in details

Cognitive Bias
Inductive Bias
Technical bias
Omitted Variable Bias
Bias-variance tradeoff
Bias metrics
Here is a list of metrics to do bias detection and to force the system to be "fair" according to these metrics.
Equal performance refers to the assurance that a model is equally accurate for patients in the protected and non-protected groups. Equal performance has 3 commonly discussed types:
- equal sensitivity (also known as equal opportunity),
- equal sensitivity and specificity, i.e. equal true positive rate and equal false positive rate (also known as equalized odds),
- equal positive predictive value (commonly referred to as predictive parity).
Equalized Odds
where A is the sensitive feature and R is a binary output {+,-}. The concept was originally defined for binary-valued Y, but in 2017, Woodworth et al. generalized the concept further for multiple classes.
Use Cases
- Criminal justice systems to ensure equal false positive and true positive rates across different races.
- Medical diagnosis systems to ensure equal error rates across genders.
Limitations
- Difficult to achieve in practice, as it requires balancing multiple rates
- May conflict with other fairness criteria and overall accuracy
Sources
Predictive Parity
where A is the sensitive feature and R is a binary output {+,-}. Predictive parity ensures that the predicted positive outcome has the same precision across different groups.
Use Cases
- Loan default predictions to ensure equal precision across different demographic groups
- Healthcare treatment recommendations to ensure equal accuracy across different patient groups
Limitations
- May not address underlying disparities in data distribution
- Can conflict with other fairness metrics like equalized odds
Demographic Parity
Demographic parity or Statistical parity (also referred as acceptance rate parity or benchmarking) refers to the property of a classifier where he subjects in the protected and unprotected groups have equal probability of being assigned to the positive predicted class. This metrics consider only the predicted outcome, not the actual outcome.
where A is the sensitive feature and R is a binary output {+,-}.
Use Cases
- Hiring algorithms to ensure equal hiring rates across genders
- Loan approval systems to provide equal approval rates across different ethnicities
Limitations
- May not account for differences in group qualifications or characteristics
- Can lead to reverse discrimination if strictly enforced
Treatment equality
Treatment equality focuses on balancing the ratio of false positives to false negatives across different groups. A classifier satisfies this definition if the subjects in the protected and unprotected groups have an equal ratio of FN and FP, satisfying the formula:

Use Cases
- Predictive policing to balance false arrest rates and failure to arrest rates across different communities
- Fraud detection systems to balance the rates of false alarms and missed frauds across different customer segments.
Limitations
- Complex to calculate and interpret
- May lead to trade-offs with overall model accuracy
Mitigation of bias
- Fairness through unawareness (FTU) (or anti-classification). There is a very intuitive approach to fairness, which usually goes under the name of fairness through unawareness (FTU), or blindness, that prescribes not to explicitly employ sensitive features when making (automated) decisions. This is effectively a notion of individual fairness, since two individuals differing only for the value of their sensitive attributes would receive the same outcome. However, in general, FTU is subject to several drawbacks, the main being that it does not take into account possible correlations between sensitive attributes and non-sensitive attributes employed in the decision-making process. For example, an agent with the (malignant) intention to discriminate on the basis of gender could introduce in the model a proxy variable for gender (i.e. a variable highly correlated with gender) and effectively using gender information while at the same time being compliant to the FTU prescription.
Bias mitigation in classification
Fairness
Libraries
1. Fairlearn
Fairlearn is an open-source Python library developed by Microsoft. It provides tools for assessing and mitigating unfairness in machine learning models. Fairlearn offers both fairness metrics and algorithms for reducing bias. It also includes visualization tools to help interpret fairness metrics and mitigation results.
- Homepage
- Fairness in machine learning
Fairlearn's GitHub
AIF360
AIF360 (AI Fairness 360) is an open-source toolkit developed by IBM. It aims to help detect and mitigate bias in AI models through a comprehensive set of fairness metrics and bias mitigation algorithms. It’s known for its extensive documentation and tutorials to guide you through the process of fairness assessment.
Themis-ML
Themis-ML is an open-source library focused on fairness-aware machine learning. It provides tools for implementing and evaluating fairness metrics and algorithms, particularly for binary classification tasks. It’s also designed to integrate easily with existing machine learning workflows using scikit-learn.
Fairness Indicators
Fairness Indicators is a library developed by Google to help assess the fairness of machine learning models. It provides tools for evaluating fairness metrics and visualizing disparities across different demographic groups. It’s designed to work seamlessly with TensorFlow Extended and TensorFlow Model Analysis.
FairComp
FairComp is an open-source library for comparing different fairness interventions and metrics. It provides a standardized framework for evaluating the trade-offs between fairness and accuracy. It also enables benchmarking of models against various fairness metrics.
- Gender Equitable AI Toolkit. An iterative e-learning toolkit designed by the NGO sector for the NGO sector to enable the implementation of gender equitable AI / ML solutions
- Aequitas. Aequitas is an open-source bias audit toolkit for machine learning developers, analysts, and policymakers to audit machine learning models for discrimination and bias, and to make informed and equitable decisions around developing and deploying predictive risk-assessment tools.
LLMs
The specific page for RAI in LLMs can be found here RAI LLM.
Emerging topics
- Fairness-Aware AI Development: Increasing focus on integrating fairness considerations throughout the AI development lifecycle, from data collection to model deployment
- Explainable AI (XAI): Development of techniques to make AI models more interpretable and transparent, helping stakeholders understand how decisions are made and identify potential biases
- Intersectional Fairness: Research on addressing intersectional biases that affect individuals belonging to multiple marginalized groups, going beyond single-axis fairness metrics
- Algorithmic Auditing: Growth of third-party auditing practices to assess and certify the fairness of AI systems, providing external validation and accountability
- Collaborative Fairness: Collaboration between AI developers, social scientists, and ethicists to create more holistic and socially aware AI systems
- Bias Mitigation Technologies: Advancements in bias mitigation techniques, including new algorithms and tools that reduce bias while maintaining model performance
Other readings
To be controlled
https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3547922
https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3792772
https://fairware.cs.umass.edu/papers/Verma.pdf
https://www.holisticai.com/blog/holistic-ai-library-tutorial
https://arxiv.org/pdf/1801.07593.pdf