Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins

Simon McGowan, Jonathan Terrett, Clive Brown, Paul Adam, Louise Aldridge, Jason Allen, Bob Amess, Kristian Andrews, Martin Barnes, David Barnwell
2004 Current Protoemics  
The identification of protein-coding genes is currently based on the merging of evidence and predictions from a variety of databases that may themselves contain inaccurate and partial information. We have developed a method for mapping accurate interpretations of protein MS-MS data to the genome. This approach enables verification of genes, exons, transcripts and variant transcripts as well as the de novo discovery of novel protein-coding genes. Here we describe improvements in spectral
more » ... tation algorithms, multiple separation techniques, sub-cellular fractionation and novel bioinformatics approaches to characterise more than 14,000 naturally occurring human genes.
doi:10.2174/1570164043488298 fatcat:wwbnwflkonamjhi2mdp5vlhy4m