Abstract
We introduce a scheme for optimally allocating multiple bits per hyperplane for Locality Sensitive Hashing (LSH). Existing approaches binarise LSH projections by thresholding at zero yielding a single bit per dimension. We demonstrate that this is a sub-optimal bit allocation approach that can easily destroy the neighbourhood structure in the original feature space. Our proposed method, dubbed Neighbourhood Preserving Quantization (NPQ), assigns multiple bits per hyperplane based upon adaptively learned thresholds. NPQ exploits a pairwise affinity matrix to discretise each dimension such that nearest neighbours in the original feature space fall within the same quantisation thresholds and are therefore assigned identical bits. NPQ is not only applicable to LSH, but can also be applied to any low-dimensional projection scheme. Despite using half the number of hyperplanes, NPQ is shown to improve LSH-based retrieval accuracy by up to 65% compared to the state-of-the-art.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval |
| Place of Publication | New York, NY, USA |
| Publisher | ACM |
| Pages | 1009-1012 |
| Number of pages | 4 |
| ISBN (Print) | 978-1-4503-2034-4 |
| DOIs | |
| Publication status | Published - 2013 |
Keywords / Materials (for Non-textual outputs)
- approximate nearest neighbour search, hamming distance, image retrieval, locality sensitive hashing, manhattan distance
Fingerprint
Dive into the research topics of 'Neighbourhood Preserving Quantisation for LSH'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver