Poster
Multiple-policy Evaluation via Density Estimation
Yilei Chen · Aldo Pacchiano · Ioannis Paschalidis
West Exhibition Hall B2-B3 #W-1011
We study the problem of evaluating the performance of multiple decision-making strategies (policies), known as multiple-policy evaluation. Evaluating a policy typically requires collecting data that reflects its behavior. A naive approach for multiple-policy evaluation is to evaluate each policy independently, which costs lots of data, as data collected for one policy cannot be reused for others.We propose a new algorithm designed to evaluate multiple policies more efficiently by leveraging the potential similarity between the policies. The algorithm operates in two phases: First, it quickly computes rough estimates of how each policy behaves using a small number of samples. Then, based on the rough estimates, we can compute an optimal strategy to collect data which will be used to evaluate the performance of all policies simultaneously.Our method reduces the data required compared to independent evaluation, especially when the number of policies is large. We provide the rigorous theoretical results for the multiple-policy evaluation problem which may also be of interest in broader contexts. In practice, our method can significantly lower costs in applications like robotics or healthcare where trying out different strategies is expensive.