Poster
Average Certified Radius is a Poor Metric for Randomized Smoothing
Chenhao Sun · Yuhao Mao · Mark Müller · Martin Vechev
East Exhibition Hall A-B #E-2206
Artificial intelligence (AI) is becoming increasingly common in various areas of our lives, making it important to ensure these AI systems are trustworthy and reliable. Randomized smoothing is one of the most popular methods used to make AI models more trustworthy. However, this paper identifies an important and widely spread problem in the evaluation of randomized smoothing. We find that one widely used evaluation metric allows the neural network to focus only on easy data and ignore hard ones. This could lead to critical problems such as unfairness. We show this problem both theoretically and empirically, with extensive evidence demonstrating that this problem has introduced strong selection bias into the development of the algorithms. To address this issue, we suggest alternative metrics to replace the current one. This will make the evaluation of the future development of trustworthy AI more reliable.