A sequence-based foldability score combined with AlphaFold2 predictions to disentangle the protein order/disorder continuum.

Proteins

Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France.

Published: April 2023

Order and disorder govern protein functions, but there is a great diversity in disorder, from regions that are-and stay-fully disordered to conditional order. This diversity is still difficult to decipher even though it is encoded in the amino acid sequences. Here, we developed an analytic Python package, named pyHCA, to estimate the foldability of a protein segment from the only information of its amino acid sequence and based on a measure of its density in regular secondary structures associated with hydrophobic clusters, as defined by the hydrophobic cluster analysis (HCA) approach. The tool was designed by optimizing the separation between foldable segments from databases of disorder (DisProt) and order (SCOPe [soluble domains] and OPM [transmembrane domains]). It allows to specify the ratio between order, embodied by regular secondary structures (either participating in the hydrophobic core of well-folded 3D structures or conditionally formed in intrinsically disordered regions) and disorder. We illustrated the relevance of pyHCA with several examples and applied it to the sequences of the proteomes of 21 species ranging from prokaryotes and archaea to unicellular and multicellular eukaryotes, for which structure models are provided in the AlphaFold protein structure database. Cases of low-confidence scores related to disorder were distinguished from those of sequences that we identified as foldable but are still excluded from accurate modeling by AlphaFold2 due to a lack of sequence homologs or to compositional biases. Overall, our approach is complementary to AlphaFold2, providing guides to map structural innovations through evolutionary processes, at proteome and gene scales.

Download full-text PDF

Source
http://dx.doi.org/10.1002/prot.26441DOI Listing

Publication Analysis

Top Keywords

amino acid
8
regular secondary
8
secondary structures
8
disorder
5
sequence-based foldability
4
foldability score
4
score combined
4
combined alphafold2
4
alphafold2 predictions
4
predictions disentangle
4

Similar Publications

L-valine holds wide-ranging applications in medicine, food, feed, and various industrial sectors. Escherichia coli, a pivotal strain in industrial L-valine production, features a concise fermentation period and a well-defined genetic background. This study focuses on mismatch repair genes (mutH, mutL, mutS, and recG) and genes associated with mutagenesis (dinB, rpoS, rpoD, and recA), employing a high-glucose adaptive culture in conjunction with metabolic modifications to systematically screen for superior phenotypes.

View Article and Find Full Text PDF

Complementary Strategies to Identify Differentially Expressed Genes in the Choroid Plexus of Patients with Progressive Multiple Sclerosis.

Neuroinformatics

January 2025

Laboratory for Applied Genomics and Bioinnovations, Instituto Oswaldo Cruz - Fiocruz, Rio de Janeiro, RJ, Brazil.

Multiple sclerosis (MS) is a neurological disease causing myelin and axon damage through inflammatory and autoimmune processes. Despite affecting millions worldwide, understanding its genetic pathways remains limited. The choroid plexus (ChP) has been studied in neurodegenerative processes and diseases like MS due to its dysregulation, yet its role in MS pathophysiology remains unclear.

View Article and Find Full Text PDF

Rice (Oryza sativa L.), Poaceae family, forms staple diet of half of world's population, and brinjal (Solanum melongena L.), an important solanaceous crop, are consumed worldwide.

View Article and Find Full Text PDF

Soil salinity poses a significant environmental challenge for the growth and development of blueberries. However, the specific mechanisms by which blueberries respond to salt stress are still not fully understood. Here, we employed a comprehensive approach integrating physiological, metabolomic, and transcriptomic analyses to identify key metabolic pathways in blueberries under salt stress.

View Article and Find Full Text PDF

Self-sufficient biocatalytic cascade for the continuous synthesis of danshensu in flow.

Appl Microbiol Biotechnol

January 2025

Department of Chemistry, Biochemistry and Pharmaceutical Sciences, University of Bern, Freiestrasse 3, 3012, Bern, Switzerland.

A new strategy has been developed to successfully produce the active component danshensu ex vivo. For this purpose, phenylalanine dehydrogenase from Bacillus sphaericus was combined with the novel hydroxyphenylpyruvate reductase from Mentha x piperita, thereby providing an in situ cofactor regeneration throughout the conversion process. The purified enzymes were co-immobilized and subsequently employed in batch biotransformation, resulting in 60% conversion of 10 mM L-dopa within 24 h, with a catalytic amount of NAD as cofactor.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!