Suffix tree construction algorithms on modern hardware

Dimitris Tsirogiannis, Nick Koudas
2010 Proceedings of the 13th International Conference on Extending Database Technology - EDBT '10  
Suffix trees are indexing structures that enhance the performance of numerous string processing algorithms. In this paper, we propose cache-conscious suffix tree construction algorithms that are tailored to CMP architectures. The proposed algorithms utilize a novel sample-based cache partitioning algorithm to improve cache performance and exploit on-chip parallelism on CMPs. Furthermore, several compression techniques are applied to effectively trade space for cache performance. Through an
more » ... sive experimental evaluation using real text data from different domains, we demonstrate that the algorithms proposed herein exhibit better cache performance than their cache-unaware counterparts and effectively utilize all processing elements, achieving satisfactory speedup.
doi:10.1145/1739041.1739075 dblp:conf/edbt/TsirogiannisK10 fatcat:ij42ym6cabf7fng5usnmof5w2y