Compression, SIMD, and Postings Lists

Andrew Trotman
2014 Proceedings of the 2014 Australasian Document Computing Symposium on - ADCS '14  
The three generations of postings list compression strategies (Variable Byte Encoding, Word Aligned Codes, and SIMD Codecs) are examined in order to test whether or not each truly represented a generational change -they do. Some weaknesses of the current SIMD-based schemes are identified and a new scheme, QMX, is introduced to address both space and decoding inefficiencies. Improvements are examined on multiple architectures and it is shown that different SSE implementations (Intel and AMD) perform differently.
doi:10.1145/2682862.2682870 dblp:conf/adcs/Trotman14 fatcat:ej226lelurhevcv6iyaum4t24m