A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
LIDER: An Efficient High-dimensional Learned Index for Large-scale Dense Passage Retrieval
[article]
2022
arXiv
pre-print
Text retrieval using dense embeddings generated from deep neural models is called "dense passage retrieval". Dense passage retrieval systems normally deploy a deep neural model followed by an approximate nearest neighbor (ANN) search module. The model generates text embeddings, which are then indexed by the ANN module. With the increasing data scale, the ANN module unavoidably becomes the bottleneck on efficiency, because of its linear or sublinear time complexity with data scale. An
arXiv:2205.00970v1
fatcat:jp3ckx7u4rd4rm4zqunkxlw4a4