The LCD-Composer webserver: high-specificity identification and functional analysis of low-complexity domains in proteins.

Bioinformatics

Department of Biochemistry and Molecular Biology, Colorado State University, Fort Collins, CO 80523, USA.

Published: December 2022

Summary: Low-complexity domains (LCDs) in proteins are regions enriched in a small subset of amino acids. LCDs exist in all domains of life, often have unusual biophysical behavior, and function in both normal and pathological processes. We recently developed an algorithm to identify LCDs based predominantly on amino acid composition thresholds. Here, we have integrated this algorithm with a webserver and augmented it with additional analysis options. Specifically, users can (i) search for LCDs in whole proteomes by setting minimum composition thresholds for individual or grouped amino acids, (ii) submit a known LCD sequence to search for similar LCDs, (iii) search for and plot LCDs within a single protein, (iv) statistically test for enrichment of LCDs within a user-provided protein set and (v) specifically identify proteins with multiple types of LCDs.

Availability And Implementation: The LCD-Composer server can be accessed at http://lcd-composer.bmb.colostate.edu. The corresponding command-line scripts can be accessed at https://github.com/RossLabCSU/LCD-Composer/tree/master/WebserverScripts.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9750097PMC
http://dx.doi.org/10.1093/bioinformatics/btac699DOI Listing

Publication Analysis

Top Keywords

low-complexity domains
8
amino acids
8
composition thresholds
8
search lcds
8
lcds
7
lcd-composer webserver
4
webserver high-specificity
4
high-specificity identification
4
identification functional
4
functional analysis
4

Similar Publications

Intracellular liquid-liquid phase separation (LLPS) of proteins and nucleic acids is a fundamental mechanism by which cells compartmentalize their components and perform essential biological functions. Molecular simulations play a crucial role in providing microscopic insights into the physicochemical processes driving this phenomenon. In this study, we systematically compare six state-of-the-art sequence-dependent residue-resolution models to evaluate their performance in reproducing the phase behaviour and material properties of condensates formed by seven variants of the low-complexity domain (LCD) of the hnRNPA1 protein (A1-LCD)-a protein implicated in the pathological liquid-to-solid transition of stress granules.

View Article and Find Full Text PDF

Background: Automatic classification of arrhythmias based on electrocardiography (ECG) data faces several significant challenges, particularly due to the substantial volume of clinical data involved in ECG signal analysis. The volume of clinical data has increased considerably, especially with the emergence of new clinical symptoms and signs in various arrhythmia conditions. These symptoms and signs, which serve as distinguishing features, can number in the tens of thousands.

View Article and Find Full Text PDF

Structural plasticity of the coiled-coil interactions in human SFPQ.

Nucleic Acids Res

December 2024

School of Molecular Sciences, The University of Western Australia, 35 Stirling Highway, Crawley, Western Australia 6009, Australia.

The proteins SFPQ (splicing Factor Proline/Glutamine rich) and NONO (non-POU domain-containing octamer-binding protein) are mammalian members of the Drosophila Behaviour/Human Splicing (DBHS) protein family, which share 76% sequence identity in their conserved 320 amino acid DBHS domain. SFPQ and NONO are involved in all steps of post-transcriptional regulation and are primarily located in mammalian paraspeckles: liquid phase-separated, ribonucleoprotein sub-nuclear bodies templated by NEAT1 long non-coding RNA. A combination of structured and low-complexity regions provide polyvalent interaction interfaces that facilitate homo- and heterodimerisation, polymerisation, interactions with oligonucleotides, mRNA, long non-coding RNA, and liquid phase-separation, all of which have been implicated in cellular homeostasis and neurological diseases including neuroblastoma.

View Article and Find Full Text PDF

Decoding the biogenesis of HIV-induced CPSF6 puncta and their fusion with the nuclear speckle.

bioRxiv

December 2024

Institut Pasteur, Advanced Molecular Virology Unit, Department of Virology, Université Paris Cité, 75015 Paris, France.

Viruses rely on host cellular machinery for replication. After entering the nucleus, the HIV genome accumulates in nuclear niches where it undergoes reverse transcription and integrates into neighboring chromatin, promoting high transcription rates and new virus progeny. Despite anti-retroviral treatment, viral genomes can persist in these nuclear niches and reactivate if treatment is interrupted, likely contributing to the formation of viral reservoirs.

View Article and Find Full Text PDF
Article Synopsis
  • The Wnt/β-catenin signaling pathway activation relies on the formation of biomolecular condensates through the polymerization of dishevelled 2 (DVL2).
  • Researchers used biochemical techniques to identify oligomeric DVL2 complexes in human cell lines, uncovering a crucial low-complexity region (LCR4) that influences complex formation.
  • The study reveals that DVL2's activation of the Wnt pathway is dependent on the interaction of specific regions within the protein, highlighting a two-step process for condensate formation involving both high-affinity and low-affinity interaction sites.
View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!