CDD: a curated Entrez database of conserved domain alignments

A. Marchler-Bauer
2003 Nucleic Acids Research  
The Conserved Domain Database (CDD) is now indexed as a separate database within the Entrez system and linked to other Entrez databases such as MEDLINE 1 . This allows users to search for domain types by name, for example, or to view the domain architecture of any protein in Entrez's sequence database. CDD can be accessed on the WorldWide Web at http://www.ncbi.nlm.nih.gov/entrez/query. fcgi?db ¼ cdd. Users may also employ the CD-Search service to identify conserved domains in new sequences, at
more » ... http://www.ncbi.nlm.nih.gov/ Structure/cdd/wrpsb.cgi. CD-Search results, and pre-computed links from Entrez's protein database, are calculated using the RPS-BLAST algorithm and Position Specific Score Matrices (PSSMs) derived from CDD alignments. CD-Searches are also run by default for protein-protein queries submitted to BLAST 1 at http://www.ncbi.nlm.nih.gov/BLAST. CDD mirrors the publicly available domain alignment collections SMART and PFAM, and now also contains alignment models curated at NCBI. Structure information is used to identify the core substructure likely to be present in all family members, and to produce sequence alignments consistent with structure conservation. This alignment model allows NCBI curators to annotate 'columns' corresponding to functional sites conserved among family members.
doi:10.1093/nar/gkg087 pmid:12520028 pmcid:PMC165534 fatcat:qexucece6reudeilu2swyzjg3i