Poster
Principled Algorithms for Optimizing Generalized Metrics in Binary Classification
Anqi Mao · Mehryar Mohri · Yutao Zhong
East Exhibition Hall A-B #E-1404
In many real-world situations, it's much more important to get certain predictions right than others. For example, in medical diagnosis, failing to detect a disease (a "false negative") is often far worse than mistakenly flagging a healthy person for more tests (a "false positive"). Similarly, when searching for a rare but important piece of information, we want to be sure we find it, even if it means we get some irrelevant results along the way. Standard methods for training machine learning models often aren't designed for these scenarios where mistakes have unequal consequences or when one type of data is much rarer than another. They treat all errors as equally bad, which can lead to poor performance on the tasks we actually care about.Our research introduces a new, more flexible way to train machine learning models that can be directly tailored to these specific, real-world needs. Instead of using a one-size-fits-all approach, we've developed a method that allows us to define what a "good" outcome looks like for a particular problem and then directly teach the learning algorithm to optimize for that goal. This avoids a common two-step process where a standard model is first trained and then tweaked, which can be inefficient and unreliable.We call our new method METRO (Metric Optimization). We've proven mathematically that this approach is reliable and effective. In experiments, models trained with METRO outperformed existing methods on these specialized tasks, showing that our technique is a powerful tool for building more practical and effective machine learning systems.