Skip to main navigation Skip to search Skip to main content

Neighbourhood Preserving Quantisation for LSH

Sean Moran, Victor Lavrenko, Miles Osborne

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We introduce a scheme for optimally allocating multiple bits per hyperplane for Locality Sensitive Hashing (LSH). Existing approaches binarise LSH projections by thresholding at zero yielding a single bit per dimension. We demonstrate that this is a sub-optimal bit allocation approach that can easily destroy the neighbourhood structure in the original feature space. Our proposed method, dubbed Neighbourhood Preserving Quantization (NPQ), assigns multiple bits per hyperplane based upon adaptively learned thresholds. NPQ exploits a pairwise affinity matrix to discretise each dimension such that nearest neighbours in the original feature space fall within the same quantisation thresholds and are therefore assigned identical bits. NPQ is not only applicable to LSH, but can also be applied to any low-dimensional projection scheme. Despite using half the number of hyperplanes, NPQ is shown to improve LSH-based retrieval accuracy by up to 65% compared to the state-of-the-art.
Original languageEnglish
Title of host publicationProceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval
Place of PublicationNew York, NY, USA
PublisherACM
Pages1009-1012
Number of pages4
ISBN (Print)978-1-4503-2034-4
DOIs
Publication statusPublished - 2013

Keywords / Materials (for Non-textual outputs)

  • approximate nearest neighbour search, hamming distance, image retrieval, locality sensitive hashing, manhattan distance

Fingerprint

Dive into the research topics of 'Neighbourhood Preserving Quantisation for LSH'. Together they form a unique fingerprint.

Cite this