Poster
Kona: An Efficient Privacy-Preservation Framework for KNN Classification by Communication Optimization
Guopeng Lin · Ruisheng Zhou · Shuyu Chen · Weili Han · Jin Tan · Wenjing Fang · Lei Wang · Tao Wei
East Exhibition Hall A-B #E-3605
Imagine several hospitals each have valuable but sensitive patient data—like medical histories or test results. To improve medical decision-making, they may want to compare a new patient’s case with similar past cases from other hospitals. A common way to do this is using a machine learning method called K-nearest neighbors (KNN), which finds the most similar past cases to help predict outcomes like likely diagnoses. The more relevant data the model can access, the more accurate its predictions become.But here's the problem: due to privacy laws, hospitals can't just share raw patient data with each other.Our work introduces Kona, an efficient system that allows different organizations to use their data together to perform KNN while keeping their data private. What makes Kona special is that it solves a big problem in existing privacy-protecting systems: they often require too much online data transfer, which makes them slow and hard to use.Kona speeds things up in two smart ways. First, it changes how distances between data points are calculated, so no extra communication is needed during that step. Second, it streamlines the process of picking the nearest neighbors, cutting down the number of times systems have to "talk" to each other. When we tested Kona with real-world data, it was up to 3,000 times more efficient in communication and over 200 times faster than the best existing methods—all while keeping data private and getting the same accurate results.