Discrimination of regulatory DNA by SVM on the basis of over- and under-represented motifs

Rene te Boekhorst, Irina I. Abnizova, Lorenz Wernisch
2008 The European Symposium on Artificial Neural Networks  
In this paper we apply three pattern recognition methods (support vector machine, cluster analysis and principal component analysis) to distinguish regulatory regions from coding and non-coding non regulatory DNA sequences. Using a new feature representation (the degree by which motifs are over-and under-represented) we demonstrate the remarkable power of this methodology in identifying regulatory regions of Drosophila melanogaster.
dblp:conf/esann/BoekhorstAW08 fatcat:6elpi3pgkfeybky4ok5c2w4fjm