Poster
Collaborative Mean Estimation Among Heterogeneous Strategic Agents: Individual Rationality, Fairness, and Truthful Contribution
Alex Clinton · Yiding Chen · Jerry Zhu · Kirthevasan Kandasamy
West Exhibition Hall B2-B3 #W-815
Machine learning has increased the value of data, which is often costly to collect but easyto share. While most existing data sharing platforms assume honesty, participantsmay lie about their contributions to gain access to others’ data.When participants have different data collection costs there are two key challenges indesigning truthful data sharing algorithms. The first is creating a method to validate thesubmission of each contributor using the other participants’ data. The second, is ensuringthere is enough data from all participants so that each agent’s submission can be sufficientlyvalidated against the others, without compromising on efficiency.We address these problem in two parts. First, we determine how to fairly divide thework of data collection. Second, we reward participants based on the quality of the datathey submitted. For each participant we compare the mean of their data to the mean ofthe others’ data. Instead of returning the others’ data to them, we first corrupt it based onthe difference of the means. If a participant wants to receive the others’ data with minimalcorruption, it is in their best interest to collect a sufficient amount of data and share ittruthfully.