Probabilistic Routing for Graph-Based Approximate Nearest Neighbor Search

Read original: arXiv:2402.11354 - Published 7/11/2024 by Kejing Lu, Chuan Xiao, Yoshiharu Ishikawa

🔄

Overview

High-dimensional approximate nearest neighbor search (ANNS) is a crucial challenge in machine learning.
Graph-based methods have emerged as the leading approach for ANNS, outperforming other techniques.
However, existing graph-based ANNS methods rely on heuristic techniques without formal theoretical backing.
This paper introduces a novel approach called PEOs that offers probabilistic guarantees when exploring a node's neighbors in the graph.

Plain English Explanation

When working with large datasets, a common task is to find the data points that are most similar to a given input. This is known as approximate nearest neighbor search (ANNS). This is particularly challenging in high-dimensional spaces, where data points can be represented by many features.

In recent years, researchers have found that graph-based methods are the best approach for solving ANNS problems. These methods build a graph structure to efficiently search for similar data points. However, the techniques used to navigate these graphs have been based on rough rules of thumb rather than rigorous mathematical principles.

This paper presents a new method called PEOs that provides a more principled way to explore the graph and identify which data points are most likely to be similar to the input. PEOs uses probabilistic techniques to estimate which neighboring nodes in the graph should be examined further, rather than relying on heuristics.

The authors show that by using PEOs, they can significantly improve the efficiency of two popular graph-based ANNS methods, HNSW and NSSG. Their experiments demonstrate that PEOs can increase the throughput, or processing speed, of these methods by 60% to 150%.

Technical Explanation

The core problem addressed in this paper is how to efficiently navigate graph-based indexes for approximate nearest neighbor search (ANNS) in high-dimensional spaces. The authors note that while graph-based methods have become the state-of-the-art for ANNS, the routing strategies used in these methods are predominantly heuristic and lack formal theoretical backing.

To address this, the authors formulate the problem as probabilistic routing, where the goal is to identify which neighbors in the graph should be considered for exact distance calculation. They develop two baseline strategies that incorporate locality-sensitive techniques. Building on this, they propose a novel approach called PEOs (Probabilistic Exploration of Neighbors) that efficiently selects the most promising neighbors to explore.

The key insight behind PEOs is that by modeling the probability distribution of distances between data points, the system can make more informed decisions about which nodes to visit during the search process. This allows PEOs to achieve significant efficiency gains compared to leading-edge routing techniques, with throughput improvements of 10-40% on commonly used graph-based ANNS indexes like HNSW and NSSG.

Critical Analysis

The authors present a well-designed and thorough study, with a clear problem statement, novel technical contributions, and extensive experimental evaluation. The PEOs approach offers a principled way to address the routing challenge in graph-based ANNS, going beyond heuristic methods.

One potential limitation is that the authors' analysis is focused on static datasets, while many real-world applications involve dynamic data that is constantly being updated. It would be interesting to see how PEOs performs in such scenarios and how it compares to techniques like CAGRA and DSEE that are designed for dynamic datasets.

Additionally, the paper does not provide much insight into the computational and memory overhead of PEOs compared to the baseline methods. Understanding these tradeoffs would be valuable for practitioners looking to apply these techniques in real-world systems.

Overall, this paper makes a compelling case for the benefits of a probabilistic approach to routing in graph-based ANNS. The PEOs method represents a promising step forward in enhancing the efficiency and performance of these important algorithms.

Conclusion

This paper introduces a novel approach called PEOs that significantly improves the efficiency of graph-based approximate nearest neighbor search (ANNS) algorithms. By modeling the probability distribution of distances between data points, PEOs is able to make more informed decisions about which nodes to explore during the search process, leading to throughput improvements of 10-40% on commonly used ANNS indexes.

The authors' work demonstrates the value of applying rigorous probabilistic techniques to enhance the routing strategies used in state-of-the-art ANNS methods. This research represents an important step forward in addressing the challenge of high-dimensional ANNS, which is a critical problem in many machine learning applications. As the field continues to evolve, techniques like PEOs may help unlock new levels of performance and efficiency in large-scale similarity search systems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🔄

Probabilistic Routing for Graph-Based Approximate Nearest Neighbor Search

Kejing Lu, Chuan Xiao, Yoshiharu Ishikawa

Approximate nearest neighbor search (ANNS) in high-dimensional spaces is a pivotal challenge in the field of machine learning. In recent years, graph-based methods have emerged as the superior approach to ANNS, establishing a new state of the art. Although various optimizations for graph-based ANNS have been introduced, they predominantly rely on heuristic methods that lack formal theoretical backing. This paper aims to enhance routing within graph-based ANNS by introducing a method that offers a probabilistic guarantee when exploring a node's neighbors in the graph. We formulate the problem as probabilistic routing and develop two baseline strategies by incorporating locality-sensitive techniques. Subsequently, we introduce PEOs, a novel approach that efficiently identifies which neighbors in the graph should be considered for exact distance calculation, thus significantly improving efficiency in practice. Our experiments demonstrate that equipping PEOs can increase throughput on commonly utilized graph indexes (HNSW and NSSG) by a factor of 1.6 to 2.5, and its efficiency consistently outperforms the leading-edge routing technique by 1.1 to 1.4 times.

7/11/2024

RoarGraph: A Projected Bipartite Graph for Efficient Cross-Modal Approximate Nearest Neighbor Search

Meng Chen, Kai Zhang, Zhenying He, Yinan Jing, X. Sean Wang

Approximate Nearest Neighbor Search (ANNS) is a fundamental and critical component in many applications, including recommendation systems and large language model-based applications. With the advancement of multimodal neural models, which transform data from different modalities into a shared high-dimensional space as feature vectors, cross-modal ANNS aims to use the data vector from one modality (e.g., texts) as the query to retrieve the most similar items from another (e.g., images or videos). However, there is an inherent distribution gap between embeddings from different modalities, and cross-modal queries become Out-of-Distribution (OOD) to the base data. Consequently, state-of-the-art ANNS approaches suffer poor performance for OOD workloads. In this paper, we quantitatively analyze the properties of the OOD workloads to gain an understanding of their ANNS efficiency. Unlike single-modal workloads, we reveal OOD queries spatially deviate from base data, and the k-nearest neighbors of an OOD query are distant from each other in the embedding space. The property breaks the assumptions of existing ANNS approaches and mismatches their design for efficient search. With insights from the OOD workloads, we propose pRojected bipartite Graph (RoarGraph), an efficient ANNS graph index built under the guidance of query distribution. Extensive experiments show that RoarGraph significantly outperforms state-of-the-art approaches on modern cross-modal datasets, achieving up to 3.56x faster search speed at a 90% recall rate for OOD queries.

8/20/2024

🧪

CAGRA: Highly Parallel Graph Construction and Approximate Nearest Neighbor Search for GPUs

Hiroyuki Ootomo, Akira Naruse, Corey Nolet, Ray Wang, Tamas Feher, Yong Wang

Approximate Nearest Neighbor Search (ANNS) plays a critical role in various disciplines spanning data mining and artificial intelligence, from information retrieval and computer vision to natural language processing and recommender systems. Data volumes have soared in recent years and the computational cost of an exhaustive exact nearest neighbor search is often prohibitive, necessitating the adoption of approximate techniques. The balanced performance and recall of graph-based approaches have more recently garnered significant attention in ANNS algorithms, however, only a few studies have explored harnessing the power of GPUs and multi-core processors despite the widespread use of massively parallel and general-purpose computing. To bridge this gap, we introduce a novel parallel computing hardware-based proximity graph and search algorithm. By leveraging the high-performance capabilities of modern hardware, our approach achieves remarkable efficiency gains. In particular, our method surpasses existing CPU and GPU-based methods in constructing the proximity graph, demonstrating higher throughput in both large- and small-batch searches while maintaining compatible accuracy. In graph construction time, our method, CAGRA, is 2.2~27x faster than HNSW, which is one of the CPU SOTA implementations. In large-batch query throughput in the 90% to 95% recall range, our method is 33~77x faster than HNSW, and is 3.8~8.8x faster than the SOTA implementations for GPU. For a single query, our method is 3.4~53x faster than HNSW at 95% recall.

7/10/2024

BANG: Billion-Scale Approximate Nearest Neighbor Search using a Single GPU

Karthik V., Saim Khan, Somesh Singh, Harsha Vardhan Simhadri, Jyothi Vedurada

Approximate Nearest Neighbour Search (ANNS) is a subroutine in algorithms routinely employed in information retrieval, pattern recognition, data mining, image processing, and beyond. Recent works have established that graph-based ANNS algorithms are practically more efficient than the other methods proposed in the literature. The growing volume and dimensionality of data necessitates designing scalable techniques for ANNS. To this end, the prior art has explored parallelizing graph-based ANNS on GPU leveraging its massive parallelism. The current state-of-the-art GPU-based ANNS algorithms either (i) require both the dataset and the generated graph index to reside entirely in the GPU memory, or (ii) they partition the dataset into small independent shards, each of which can fit in GPU memory, and perform the search on these shards on the GPU. While the first approach fails to handle large datasets due to the limited memory available on the GPU, the latter delivers poor performance on large datasets due to high data traffic over the low-bandwidth PCIe bus. We introduce BANG, a first-of-its-kind technique for graph-based ANNS on GPU for billion-scale datasets that cannot entirely fit in the GPU memory. BANG stands out by harnessing a compressed form of the dataset on a single GPU to perform distance computations while efficiently accessing the graph index kept on the host memory, enabling efficient ANNS on large graphs within the limited GPU memory. BANG incorporates highly optimized GPU kernels and proceeds in phases that run concurrently on the GPU and CPU. Notably, on the billion-size datasets, we achieve throughputs 40x-200x more than the competing methods for a high recall value of 0.9. Additionally, BANG is the best in cost- and power-efficiency among the competing methods from the recent Billion-Scale Approximate Nearest Neighbour Search Challenge.

7/22/2024