A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2009; you can also visit the original URL.
The file type is application/pdf
.
Efficient clustering of large EST data sets on parallel computers
2003
Nucleic Acids Research
Clustering expressed sequence tags (ESTs) is a powerful strategy for gene identi®cation, gene expression studies and identifying important genetic variations such as single nucleotide polymorphisms. To enable fast clustering of large-scale EST data, we developed PaCE (for Parallel Clustering of ESTs), a software program for EST clustering on parallel computers. In this paper, we report on the design and development of PaCE and its evaluation using Arabidopsis ESTs. The novel features of our
doi:10.1093/nar/gkg379
pmid:12771222
pmcid:PMC156714
fatcat:klms2artobbn3n45dfjfb7jwri