Poster
Dynamic Similarity Graph Construction with Kernel Density Estimation
Steinar Laenen · Peter Macgregor · He Sun
East Exhibition Hall A-B #E-2004
Imagine you have a constantly growing collection of items, like photos or social media posts, and you want to automatically group similar items together. A common way to achieve this is to first figure out how similar each pair of items is, which can be a lot of work if you have many items. This similarity information can be thought of as a network or "graph". The problem is, when new items arrive, recalculating all these similarities and updating the groups from scratch is very slow. A key part of this similarity calculation is quickly estimating how many items are "around" any given item, a process called Kernel Density Estimation (KDE).This research provides a new, much faster way to do this KDE even as new items are added, without redoing all the calculations each time. Building on this, the paper introduces a method to efficiently maintain a simplified version of the full similarity network that still captures the main groups or "clusters". This means we can update our understanding of the groups quickly when new data arrives.This work makes it practical to find and track evolving groups in large, constantly changing datasets. This is useful for scenarios like understanding how communities form in social networks or how topics trend online, by making the process faster and more scalable than previous methods while still finding accurate groupings.