Online Learning with Multiple Fairness Regularizers via Graph-Structured Feedback

cs.LG arXiv:2508.14311
View PDF arXiv JSON

Abstract

There is an increasing need to enforce multiple, often competing, measures of fairness within automated decision systems. The appropriate weighting of these fairness objectives is typically unknown a priori, may change over time and, in our setting, must be learned adaptively through sequential interactions. In this work, we address this challenge in a bandit setting, where decisions are made with graph-structured feedback.

PDF Viewer