Skip to yearly menu bar Skip to main content


Oral
in
Workshop: The 1st Workshop on Vector Databases

Oral 1: The RaBitQ Library

Jianyang Gao · Yutong Gou · Yuexuan Xu · Jifan Shi · Zhonghao Yang · Cheng Long

[ ]
Fri 18 Jul 9:50 a.m. PDT — 10:10 a.m. PDT
 
presentation: The 1st Workshop on Vector Databases
Fri 18 Jul 8:30 a.m. PDT — 5:30 p.m. PDT

Abstract:

High-dimensional vectors serve as a bridge between massive real-world data and AI applications such as large language models (LLMs). Yet, the data is often of large scale and with high dimensionality, which causes high costs in both memory consumption and computation. To reduce both costs, vector quantization has been widely adopted. A recent advance in vector quantization is the RaBitQ method, which supports flexible compression rates and provides asymptotically optimal error bounds for distance estimation. It demonstrates strong empirical performance in terms of space, accuracy and runtime. In this paper, we first provide a comprehensive overview of RaBitQ’s design, insights and theoretical guarantees. We then introduce the RaBitQ Library, which covers some key implementation techniques of RaBitQ and its integration with indices for approximate nearest neighbor search.

Chat is not available.