Cite
[1] A. Andoni, P. Indyk, T. Laarhoven, I. Razenshteyn, e L. Schmidt, «Practical and optimal LSH for angular distance», in Proceedings of the 28th International Conference on Neural Information Processing Systems - Volume 1, in NIPS’15. Cambridge, MA, USA: MIT Press, dic. 2015, pp. 1225–1233.
Synthesis
Metadata
FirstAuthor:: Andoni, Alexandr
Author:: Indyk, Piotr
Author:: Laarhoven, Thijs
Author:: Razenshteyn, Ilya
Author:: Schmidt, Ludwig
~
Title:: Practical and optimal LSH for angular distance
Year:: 2015
Citekey:: andoniPracticalOptimalLSH2015a
itemType:: conferencePaper
Publisher:: MIT Press
Location:: Cambridge, MA, USA
Pages:: 1225–1233
LINK
Abstract
We show the existence of a Locality-Sensitive Hashing (LSH) family for the angular distance that yields an approximate Near Neighbor Search algorithm with the asymptotically optimal running time exponent. Unlike earlier algorithms with this property (e.g., Spherical LSH [1, 2]), our algorithm is also practical, improving upon the well-studied hyperplane LSH [3] in practice. We also introduce a multiprobe version of this algorithm and conduct an experimental evaluation on real and synthetic data sets.We complement the above positive results with a fine-grained lower bound for the quality of any LSH family for angular distance. Our lower bound implies that the above LSH family exhibits a trade-off between evaluation time and quality that is close to optimal for a natural class of LSH functions. .
Notes
Bound for collision probability
Suppose that are such that , where . Then
The implication is that the cross-polytope LSH achieves the same bounds as the theoretically optimal LSH for the sphere
Cross-polytope LSH definition
the hash family is defined on a unit sphere
Let be a random matrix with i.i.d Gaussian entries. The function to hash a point is the following.
Compute:
and find the point closest to from , where is the i-th standard basis vector of ..