Poster
Learning With Multi-Group Guarantees For Clusterable Subpopulations
Jessica Dai · Nika Haghtalab · Eric Zhao
West Exhibition Hall B2-B3 #W-1017
One common goal for designing algorithms that make predictions is for those algorithms to perform well not just on average over the whole population, but also for meaningful subsets of that population. For example, the algorithm should be equally accurate across gender, race, and their intersections. In this work, we explore an alternative way to define subgroups from a population. Rather than using static, pre-defined identity categories (like gender and race), can we provide performance guarantees on subgroups that we learn directly from the relevant data? We study this question from a theoretical perspective and answer in the affirmative, showing an algorithmic strategy that, surprisingly, can perform almost as well as an algorithm that had pre-defined the groups ahead of time.