ICML Poster Learning from Sample Stability for Deep Clustering

Poster

Learning from Sample Stability for Deep Clustering

Zhixin Li · Yuheng Jia · Hui LIU · Junhui Hou

East Exhibition Hall A-B #E-1810

[ Abstract ] [ Lay Summary ]

[ Poster] [ OpenReview]

Thu 17 Jul 4:30 p.m. PDT — 7 p.m. PDT

Abstract:

Deep clustering, an unsupervised technique independent of labels, necessitates tailored supervision for model training. Prior methods explore supervision like similarity and pseudo labels, yet overlook individual sample training analysis. Our study correlates sample stability during unsupervised training with clustering accuracy and network memorization on a per-sample basis. Unstable representations across epochs often lead to mispredictions, indicating difficulty in memorization and atypicality. Leveraging these findings, we introduce supervision signals for the first time based on sample stability at the representation level. Our proposed strategy serves as a versatile tool to enhance various deep clustering techniques. Experiments across benchmark datasets showcase that incorporating sample stability into training can improve the performance of deep clustering. The code is available at https://github.com/LZX-001/LFSS.

Lay Summary:

In this paper, we introduced a new approach to grouping data that uses the idea of how consistent the representation of each data point behaves during the learning process. After running many tests, we found that how stable a data point is closely relates to how accurately it can be grouped and how well the model remembers it. Based on these observations, we developed a new method, which takes advantage of data point stability at both the individual and group levels to improve the overall performance of deep learning-based grouping techniques.

Chat is not available.