Efficient document retrieval in main memory

Trevor Strohman, W. Bruce Croft
2007 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '07  
Disk access performance is a major bottleneck in traditional information retrieval systems. Compared to system memory, disk bandwidth is poor, and seek times are worse. We circumvent this problem by considering query evaluation strategies in main memory. We show how new accumulator trimming techniques combined with inverted list skipping can produce extremely high performance retrieval systems without resorting to methods that may harm effectiveness. We evaluate our techniques using Galago, a
more » ... w retrieval system designed for efficient query processing. Our system achieves a 69% improvement in query throughput over previous methods.
doi:10.1145/1277741.1277774 dblp:conf/sigir/StrohmanC07 fatcat:klzctjeugbd5jncruc24onwv3u