Automatic Causal Fairness Analysis with LLM-Generated Reporting

arXiv cs.LG / 5/1/2026

📰 NewsDeveloper Stack & InfrastructureModels & Research

Key Points

  • The paper introduces FairMind, a prototype that automates causal fairness analysis for AutoML pipelines by focusing on fairness at the dataset level.
  • FairMind uses the recently proposed “standard fairness model” assumptions to enable sound evaluation of causal effects via counterfactual queries that account for protected features, confounders, and mediators.
  • After dataset preprocessing, the system computes fairness effects in closed form, then leverages LLMs to generate clear, accurate fairness reports about the detected issues.
  • The approach is demonstrated in a zero-shot setting and is shown (via examples) to offer advantages over doing fairness analysis directly with an LLM.
  • The work also discusses extensions for ordinal protected variables, continuous targets, and new decomposition results to broaden applicability.

Abstract

AutoML, intended as the process of automating the application of machine learning to real-world problems, is a key step for AI popularisation. Most AutoML frameworks are not accounting for the potential lack of fairness in the training data and in the corresponding predictions. We introduce \textsc{FairMind}, a software prototype aiming to automatise fairness analysis at the dataset level. We achieve that by resorting to the assumptions of the \emph{standard fairness model}, recently proposed by Ple\v{c}ko and Bareinboim. This allows for a sound fairness evaluation in terms of causal effects, based on \emph{counterfactual} queries involving the target, possibly confounders and mediators, and the different values of an input feature we regard as \emph{protected}. After the necessary data preprocessing, the tool implements a closed-form computation of the effects. LLMs are consequently exploited to generate accurate reports on the fairness levels detected in the training dataset. We achieve that in a zero-shot setup and show by examples the expected advantages with respect to a direct analysis performed by the LLM. To favour applications, extensions to ordinal protected variable and continuous targets and novel decomposition results are also discussed.

Automatic Causal Fairness Analysis with LLM-Generated Reporting | AI Navigate