Poster
Approximate Forest Completion and Learning-Augmented Algorithms for Metric Minimum Spanning Trees
Nate Veldt · Thomas Stanley · Benjamin Priest · Trevor Steil · Keita Iwabuchi · T.S. Jayram · Geoffrey Sanders
East Exhibition Hall A-B #E-1709
Finding a minimum spanning tree for a set of data points is a fundamental computational task that can be used, among other applications, for clustering data into similar groups of points. There are already exact algorithms for solving this problem that rely on computing distances between all pairs of points as a first step. However, this approach is too slow for applications that involve massive datasets, especially when relationships between data points are given by complicated distance functions. In our work, we design a fast new approach for finding a spanning tree that does not require computing distances between all pairs of data points. The method starts by connecting some subsets of points with a fast machine learning heuristic. It then finds a near optimal way to connect these pieces together to form a full spanning tree. A key contribution of our work is our theoretical analysis: we show that if the starting point computed by the fast machine learning heuristic overlaps well with an optimal minimum spanning tree, then the spanning tree our algorithm produces is provably close to optimal.