Skip to yearly menu bar Skip to main content


Poster
in
Workshop: The 1st Workshop on Vector Databases

LEANN: A Low-Storage Vector Index for Personal Devices

Yichuan Wang · Shu Liu · Zhifei Li · Yongji Wu · Ziming Mao · Yilong Zhao · Xiao Yan · Zhiying Xu · Yang Zhou · Ion Stoica · Sewon Min · Matei Zaharia · Joseph E Gonzalez

[ ] [ Project Page ]
Fri 18 Jul 1:45 p.m. PDT — 3 p.m. PDT

Abstract:

Embedding-based search has become increasingly popular in applications such as recommendation and retrieval augmented generation (RAG). Recently, there is growing demand to support these capabilities over personal data stored locally on devices. However, maintaining the necessary data structure associated with the embedding search is often infeasible due to its high storage overhead. For example, indexing 100 GB of raw data requires 150 to 700 GB of storage, making local deployment impractical. Reducing this overhead while maintaining search quality and latency becomes a critical challenge. In this paper, we present LiteANN, a storage-efficient approximate nearest neighbor (ANN) search index optimized for resource-constrained personal devices. LiteANN combines a compact graph-based structure with an efficient on-the-fly recomputation strategy to enable fast and accurate retrieval with minimal storage overhead. Our evaluation shows that LiteANN reduces index size to under 5% of the original raw data – up to 50× smaller than standard indexes – while achieving 90% top-3 recall in under 2 seconds on real-world question-answering benchmarks.

Chat is not available.