INDEXING GAPPED-FACTORS USING A TREE

PIERRE PETERLONGO, JULIEN ALLALI, MARIE-FRANCE SAGOT
2008 International Journal of Foundations of Computer Science  
We present a data structure to index a specific kind of factors, that is of substrings, called gapped-factors. A gapped-factor is a factor containing a gap that is ignored during the indexation. The data structure presented is based on the suffix tree and indexes all the gapped-factors of a text with a fixed size of gap, and only those. The construction of this data structure is done online in linear time and space. Such a data structure may play an important role in various pattern matching
more » ... pattern matching and motif inference problems, for instance in text filtration.
doi:10.1142/s0129054108005541 fatcat:kkdtpzxecjecnbj6ursbhuxhb4