NCBI's Conserved Domain Database (CDD) is a collection of multiple sequence alignments and derived database search models, which represent protein domains conserved in molecular evolution. The collection can be accessed at http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml, and is also part of NCBI's Entrez query and retrieval system, cross-linked to numerous other resources. CDD provides annotation of domain footprints and conserved functional sites on protein sequences. Precalculated domain annotation can be retrieved for protein sequences tracked in NCBI's Entrez system, and CDD's collection of models can be queried with novel protein sequences via the CD-Search service at http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi. Starting with the latest version of CDD, v2.14, information from redundant and homologous domain models is summarized at a superfamily level, and domain annotation on proteins is flagged as either 'specific' (identifying molecular function with high confidence) or as 'non-specific' (identifying superfamily membership only).

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2686570PMC
http://dx.doi.org/10.1093/nar/gkn845DOI Listing

Publication Analysis

Top Keywords

protein sequences
12
conserved domain
8
domain database
8
ncbi's entrez
8
domain annotation
8
domain
6
cdd
4
cdd specific
4
specific functional
4
annotation
4

Similar Publications

Proteins have proven to be useful agents in a variety of fields, from serving as potent therapeutics to enabling complex catalysis for chemical manufacture. However, they remain difficult to design and are instead typically selected for using extensive screens or directed evolution. Recent developments in protein large language models have enabled fast generation of diverse protein sequences in unexplored regions of protein space predicted to fold into varied structures, bind relevant targets, and catalyze novel reactions.

View Article and Find Full Text PDF

AlphaFold2 (AF2), a deep-learning based model that predicts protein structures from their amino acid sequences, has recently been used to predict multiple protein conformations. In some cases, AF2 has successfully predicted both dominant and alternative conformations of fold-switching proteins, which remodel their secondary and tertiary structures in response to cellular stimuli. Whether AF2 has learned enough protein folding principles to reliably predict alternative conformations outside of its training set is unclear.

View Article and Find Full Text PDF

mRNA delivery offers new opportunities for disease treatment by directing cells to produce therapeutic proteins. However, designing highly stable mRNAs with programmable cell type-specificity remains a challenge. To address this, we measured the regulatory activity of 60,000 5' and 3' untranslated regions (UTRs) across six cell types and developed PARADE (Prediction And RAtional DEsign of mRNA UTRs), a generative AI framework to engineer untranslated RNA regions with tailored cell type-specific activity.

View Article and Find Full Text PDF

The genus boasts abundant germplasm resources and comprises numerous species. Among these, medicinal plants of this genus, which have a long history, have garnered attention of scholars. This study sequenced and analyzed the chloroplast genomes of six species of medicinal plants (, , , , , and , respectively) to explore their interspecific relationships.

View Article and Find Full Text PDF

is a well-known opportunistic pathogen, responsible for various nosocomial infections. UOL-KIMZ-24 was previously isolated from a clinical specimen, collected from Lahore General Hospital, Lahore (LGH), Pakistan, dated 3rd March, 2022. During the initial screening for antimicrobial susceptibility, the UOL-KIMZ-24 was found a multiple drug resistant (MDR) strain.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!