Poster
Scalable Sobolev IPM for Probability Measures on a Graph
Tam Le · Truyen Nguyen · Hideitsu Hino · Kenji Fukumizu
East Exhibition Hall A-B #E-1500
We study a mathematical method called Sobolev IPM, which helps compare two sets of data points in the form of probabilities. This is useful in machine learning, where we often need to compare data distributions, e.g., to tell whether they are similar or not. The Sobolev IPM problem is hard to calculate. Therefore, it has not been used much in real-world applications. We develop a way to make it much easier and faster to compute, especially when the data points in the sets are connected in a graph. More concretely, we connect Sobolev IPM to a simpler type of math, leading to a new version that gives us a fast answer with the new formula. Moreover, it is then possible to use Sobolev IPM even on large datasets. Furthermore, the new version can be used to build even more powerful tools, called kernels for comparing sets of data points. We then test the proposed approach on tasks like classifying documents and analyzing complex shapes, and obtain promising results.