Poster
On the Power of Learning-Augmented Search Trees
Jingbang Chen · Xinyuan Cao · Alicia Stepin · Li Chen
East Exhibition Hall A-B #E-2009
Search trees are a fundamental tool in how computers store data. In real world applications--like databases or recommendation systems--we often don’t know in advance which items will be accessed most or what the access patterns will look like.Our work presents a new search tree, based on a classical data structure called a "Treap", which uses machine learning predictions to optimize data storage in the search tree. By predicting which items will be accessed more often, the tree allows quicker overall accesses to these items by moving these frequently-referenced elements to "earlier" parts of the data structure.We show that our method achieves \textbf{static optimality}, meaning it performs as well as the best possible tree tailored to the true access frequencies--assuming we had advanced knowledge of this true distribution.This paper generalizes previous work that relied on assumptions about data access patterns and we show how we can achieve similar speedups for a wider pattern of access frequencies, and we maintain strong guarantees even with noisy predictions. We also extend our method to disk-based systems using B-Trees.Finally, we back our theoretical guarantees with experimental results, demonstrating that our learning-augmented search trees consistently outperform traditional data structures in practice across a wide range of patterns.