A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2009; you can also visit the original URL.
The file type is
Comparative genomics of lactic acid bacteria reveals a niche-specific gene set.doi:10.1186/1471-2180-9-50 pmid:19265535 pmcid:PMC2660350 fatcat:luhk2r2zbnesjmjgeko24axyma
doi:10.1016/j.vetimm.2005.12.007 pmid:16476489 fatcat:ogpk3xntubcb3hqjv65yg5vkiq
In the last two years the Pfam database (http://pfam. xfam.org) has undergone a substantial reorganisation to reduce the effort involved in making a release, thereby permitting more frequent releases. Arguably the most significant of these changes is that Pfam is now primarily based on the UniProtKB reference proteomes, with the counts of matched sequences and species reported on the website restricted to this smaller set. Building families on reference proteomes sequences brings greaterdoi:10.1093/nar/gkv1344 pmid:26673716 pmcid:PMC4702930 fatcat:o7rimhdmlrawdjm6ahqj3yfyhe
more »... ty, which decreases the amount of manual curation required to maintain them. It also reduces the number of sequences displayed on the website, whilst still providing access to many important model organisms. Matches to the full UniProtKB database are, however, still available and Pfam annotations for individual UniProtKB sequences can still be retrieved. Some Pfam entries (1.6%) which have no matches to reference proteomes remain; we are working with UniProt to see if sequences from them can be incorporated into reference proteomes. Pfam-B, the automaticallygenerated supplement to Pfam, has been removed. The current release (Pfam 29.0) includes 16 295 entries and 559 clans. The facility to view the relationship between families within a clan has been improved by the introduction of a new tool.
Citation details: Sangrador-Vegas,A., Mitchell,A., Chang,H.-Y. et al. GO annotation in InterPro: why stability does not indicate accuracy in a sea of changing annotations. ...doi:10.1093/database/baw027 pmid:26994912 pmcid:PMC4799721 fatcat:lwve4cepezbcvi6t2k6odynq4u
The InterPro database (http://www.ebi.ac.uk/ interpro/) is a freely available resource that can be used to classify sequences into protein families and to predict the presence of important domains and sites. Central to the InterPro database are predictive models, known as signatures, from a range of different protein family databases that have different biological focuses and use different methodological approaches to classify protein families and domains. InterPro integrates these signatures,doi:10.1093/nar/gku1243 pmid:25428371 pmcid:PMC4383996 fatcat:4asvzb32obgwhlvdkrfhiokvme
more »... apitalizing on the respective strengths of the individual databases, to produce a powerful protein classification resource. Here, we report on the status of InterPro Figure 1. InterPro matches for UniProtKB entry Q3JCG5 showing predicted protein family membership, domains and sites.
The InterPro database (http://www.ebi.ac.uk/ interpro/) classifies protein sequences into families and predicts the presence of functionally important domains and sites. Here, we report recent developments with InterPro (version 70.0) and its associated software, including an 18% growth in the size of the database in terms on new InterPro entries, updates to content, the inclusion of an additional entry type, refined modelling of discontinuous domains, and the development of a new programmaticdoi:10.1093/nar/gky1100 pmid:30398656 fatcat:ccbjkaabarhazk3yclk2vjjqoq
more »... nterface and website. These developments extend and enrich the information provided by InterPro, and provide greater flexibility in terms of data access. We also show that InterPro's sequence coverage has kept pace with the growth of UniProtKB, and discuss how our evaluation of residue coverage may help guide future curation activities.
Sangrador-Vegas, et al. ... background band that does not run true to size. 76 [ 2 ] 762 Robert D Finn, Penelope Coggill, Ruth Y Eberhardt, Sean R Eddy, Jaina Mistry, Alex L Mitchell, Simon C Potter, 77 Marco Punta, Matloob Qureshi, Amaia ...doi:10.1101/136077 fatcat:g6qymezotjfexlth4sg4trgzeq
Robert D Finn, Penelope Coggill, Ruth Y Eberhardt, Sean R Eddy, Jaina Mistry, Alex L Mitchell, Simon C Potter, Marco Punta, Matloob Qureshi, Amaia Sangrador-Vegas, et al. ...doi:10.1093/bioinformatics/btx565 pmid:28968636 fatcat:eam4zgsl4zbrxfrw2ahupxopv4
ACKNOWLEDGEMENTS The authors would like to thank former members of the InterPro team: Sara El-Gebali, Matthew Fraser, Aurélien Luciani, Sebastien Pesseat, Simon Potter, Neil Rawlings, Amaia Sangrador-Vegas ...doi:10.1093/nar/gkaa977 pmid:33156333 pmcid:PMC7778928 fatcat:vfs7k3dxr5bcpbpvxbptj6r46m