A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
Peptide partitions and protein identification: a computational analysis
[article]
2016
bioRxiv
pre-print
Peptide sequences from a proteome can be partitioned into N mutually exclusive sets and used to identify their parent proteins in a sequence database. This is illustrated with the human proteome (http://www.uniprot.org; id UP000005640), which is partitioned into eight subsets KZ*R, KZ*D, KZ*E, KZ*, Z*R, Z*D, Z*E, and Z*, where Z ϵ {A, N, C, Q, G, H, I, L, M, F, P, S, T, W, Y, V} and Z* ≡ 0 or more occurrences of Z. If the full peptide sequence is known then over 98% of the proteins in the
doi:10.1101/069526
fatcat:o34qqxqi6bbfzoxzupfq7ihoi4