Incremental updates of inverted lists for text document retrieval

Anthony Tomasic, Héctor García-Molina, Kurt Shoens
1994 SIGMOD record  
With the proliferation of the world's \information highways" a renewed interest in e cient document indexing techniques has come about. In this paper, the problem of incremental updates of inverted lists is addressed using a new dual-structure index data structure. The index dynamically separates long and short inverted lists and optimizes the retrieval, update, and storage of each t ype of list. To study the behavior of the index, a space of engineering tradeo s which range from optimizing
more » ... te time to optimizing query performance is described. We quantitatively explore this space by using actual data and hardwa r e i n c o m bination with a simulation of an information retrieval system. We then describe the best algorithm for a variety of criteria.
doi:10.1145/191843.191896 fatcat:lfaiujctzfgfbmntadjxwiok2q