A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is
We present an approach to searching genetic DNA sequences using an adaptation of the suf- x tree data structure deployed on the general purpose persistent J a va platform, PJama. Our implementation technique is novel, in that it allows us to build su x trees on disk for arbitrarily large sequences, for instance for the longest human chromosome consisting of 263 million letters. We propose to use such indexes as an alternative to the current practice of serial scanning. We describe our treedblp:conf/vldb/HuntAI01 fatcat:zx3urj3o45hcfpakr2spefbocy