A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2010; you can also visit the original URL.
The file type is application/pdf
.
Protein Sequence Classification Through Relevant Sequence Mining and Bayes Classifiers
[chapter]
2005
Lecture Notes in Computer Science
We tackle the problem of sequence classification using relevant subsequences found in a dataset of protein labelled sequences. A subsequence is relevant if it is frequent and has a minimal length. For each query sequence a vector of features is obtained. The features consist in the number and average length of the relevant subsequences shared with each of the protein families. Classification is performed by combining these features in a Bayes Classifier. The combination of these characteristics
doi:10.1007/11595014_24
fatcat:5hd3jnflsjgdporlnm5tk2gap4