ICML Why Loss Re-weighting Works If You Stop Early: Training Dynamics of Unconstrained Features

Poster
in
Workshop: Methods and Opportunities at Small Scale (MOSS)

Why Loss Re-weighting Works If You Stop Early: Training Dynamics of Unconstrained Features

Yize Zhao · Christos Thrampoulidis

Keywords: [ Learning dynamics ] [ Unconstrained features model (UFM) ] [ Loss reweighting ] [ Early stopping ] [ Class imbalance ]

[ Abstract ] [ Project Page ]

[ OpenReview]

Abstract:

The application of loss reweighting in modern deep learning presents a nuanced picture. While it fails to alter the terminal learning phase in overparameterized deep neural networks (DNNs) trained on high-dimensional datasets, empirical evidence consistently shows it offers significant benefits early in training. To transparently demonstrate and analyze this phenomenon, we introduce a small-scale model (SSM). This model is specifically designed to abstract the inherent complexities of both the DNN architecture and the input data, while maintaining key information about the structure of imbalance within its spectral components. On the one hand, the SSM reveals how vanilla empirical risk minimization preferentially learns to distinguish majority classes over minorities early in training, consequently delaying minority learning. In stark contrast, reweighting restores balanced learning dynamics, enabling the simultaneous learning of features associated with both majorities and minorities.

Chat is not available.

Poster in Workshop: Methods and Opportunities at Small Scale (MOSS)

Why Loss Re-weighting Works If You Stop Early: Training Dynamics of Unconstrained Features

Yize Zhao · Christos Thrampoulidis

Poster
in
Workshop: Methods and Opportunities at Small Scale (MOSS)