Skip to yearly menu bar Skip to main content


Poster

Optimal Survey Design for Private Mean Estimation

Yu-Wei Chen · Raghu Pasupathy · Jordan A Awan

East Exhibition Hall A-B #E-603
[ ] [ ]
Wed 16 Jul 11 a.m. PDT — 1:30 p.m. PDT

Abstract:

This work identifies the first privacy-aware stratified sampling scheme that minimizes the variance for general private mean estimation under the Laplace, Discrete Laplace (DLap) and Truncated-Uniform-Laplace (TuLap) mechanisms within the framework of differential privacy (DP). We view stratified sampling as a subsampling operation, which amplifies the privacy guarantee; however, to have the same final privacy guarantee for each group, different nominal privacy budgets need to be used depending on the subsampling rate. Ignoring the effect of DP, traditional stratified sampling strategies risk significant variance inflation. We phrase our optimal survey design as an optimization problem, where we determine the optimal subsampling sizes for each group with the goal of minimizing the variance of the resulting estimator. We establish strong convexity of the variance objective, propose an efficient algorithm to identify the integer-optimal design, and offer insights on the structure of the optimal design.

Lay Summary:

This work develops a new survey method for collecting social data that protects individual privacy while ensuring the most accurate results. By carefully choosing how many responses to gather from each group of people, the method provides an optimal estimate of the private mean.

Chat is not available.