ICML Q-Adam-mini: Memory-Efficient 8-bit Quantized Optimizer for Large Language Model Training

Poster
in
Workshop: ES-FoMo III: 3rd Workshop on Efficient Systems for Foundation Models

Q-Adam-mini: Memory-Efficient 8-bit Quantized Optimizer for Large Language Model Training

Yizhou Han · Chaohao Yang · Congliang Chen · Xingjian Wang · Ruoyu Sun

[ Abstract ] [ Project Page ]

[ OpenReview]

Abstract: We propose $\textbf{Q-Adam-mini}$, a memory-efficient optimizer for Large Language Model (LLM) training that achieves $\textbf{8$\times$}$ reduction in GPU memory usage while maintaining performance parity with full-precision AdamW. Building upon Adam-mini, which reduces memory footprint of optimizer states by 50\% compared to AdamW, we further improve memory efficiency through states quantization. We achieve this by: (i) quantizing the first-order momentum ($m$) to $\textbf{INT8}$ and (ii) retaining the second-order momentum ($v$) in $\textbf{FP32}$, which occupies less than 1\% of total memory. However, embedding layer exhibits weight norm instability. We analyze this issue and address it by applying stochastic rounding for momentum quantization exclusively to the embedding layer. We validate our approach on both pre-training and fine-tuning tasks, with the model size ranging from 60M to 8B. Our results demonstrate that Q-Adam-mini enables scalable LLM training with limited computational resources. Codes are available at: https://github.com/LouisCroix/Q-Adam-mini

Chat is not available.

Poster in Workshop: ES-FoMo III: 3rd Workshop on Efficient Systems for Foundation Models

Q-Adam-mini: Memory-Efficient 8-bit Quantized Optimizer for Large Language Model Training

Yizhou Han · Chaohao Yang · Congliang Chen · Xingjian Wang · Ruoyu Sun

Poster
in
Workshop: ES-FoMo III: 3rd Workshop on Efficient Systems for Foundation Models