Motivation: Structured Tandem Repeats Proteins (STRPs) constitute a subclass of tandem repeats characterized by repetitive structural motifs. These proteins exhibit distinct secondary structures that form repetitive tertiary arrangements, often resulting in large molecular assemblies. Despite highly variable sequences, STRPs can perform important and diverse biological functions, maintaining a consistent structure with a variable number of repeat units. With the advent of protein structure prediction methods, millions of 3D models of proteins are now publicly available. However, automatic detection of STRPs remains challenging with current state-of-the-art tools due to their lack of accuracy and long execution times, hindering their application on large datasets. In most cases, manual curation remains the most accurate method for detecting and classifying STRPs, making it impracticable to annotate millions of structures.

Results: We introduce STRPsearch, a novel tool for the rapid identification, classification, and mapping of STRPs. Leveraging manually curated entries from RepeatsDB as the known conformational space of STRPs, STRPsearch uses the latest advances in structural alignment for a fast and accurate detection of repeated structural motifs in proteins, followed by an innovative approach to map units and insertions through the generation of TM-score profiles. STRPsearch is highly scalable, efficiently processing large datasets, and can be applied to both experimental structures and predicted models. In addition, it demonstrates superior performance compared to existing tools, offering researchers a reliable and comprehensive solution for STRP analysis across diverse proteomes.

Availability And Implementation: STRPsearch is coded in Python. All scripts and associated documentation are available from: https://github.com/BioComputingUP/STRPsearch.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11645253PMC
http://dx.doi.org/10.1093/bioinformatics/btae690DOI Listing

Publication Analysis

Top Keywords

structured tandem
8
tandem repeats
8
structural motifs
8
motifs proteins
8
large datasets
8
strps
6
strpsearch
5
proteins
5
strpsearch fast
4
fast detection
4

Similar Publications

Analysis of the CHS Gene Family Reveals Its Functional Responses to Hormones, Salinity, and Drought Stress in Moso Bamboo ().

Plants (Basel)

January 2025

State Key Laboratory of Tree Genetics and Breeding, Co-Innovation Center for Sustainable Forestry in Southern China, Bamboo Research Institute, Key Laboratory of National Forestry and Grassland Administration on Subtropical Forest Biodiversity Conservation, School of Life Sciences, Nanjing Forestry University, Nanjing 210037, China.

Chalcone synthase (CHS), the first key structural enzyme in the flavonoid biosynthesis pathway, plays a crucial role in regulating plant responses to abiotic stresses and hormone signaling. However, its molecular functions remain largely unknown in , which is one of the most economically and ecologically important bamboo species and the most widely distributed one in China. This study identified 17 genes in and classified them into seven subgroups, showing a closer evolutionary relationship to genes from rice.

View Article and Find Full Text PDF

A concise, transition metal-free four-step synthetic pathway has been developed for the synthesis of tetracyclic heterosteroidal compounds, 14-aza-12-oxasteroids, starting from readily available 2-naphthol analogues. After conversion of 2-naphthols to 2-naphthylamines by the Bucherer reaction, subsequent selective C-acetylation was achieved via the Sugasawa reaction and reduction of the acetyl group using borohydride, which resulted into the corresponding amino-alcohols. The naphthalene-based amino-alcohols underwent double dehydrations and double intramolecular cyclization with oxo-acids leading to one-pot formation of a C-N bond, a C-O bond and an amide bond in tandem, to generate two additional rings completing the steroidal framework.

View Article and Find Full Text PDF

Conjugation of short-chain fatty acids (SDFAs) to amines containing ring structures allows for better measurement by liquid chromatography tandem mass spectroscopy (LC-MS/MS). However, collision-induced dissociation (CID) results in breaking the conjugate back to the original SCFA and amine. We therefore set out to find an amine that would remain on the SCFA after CID and create a unique daughter for selectivity of measurement.

View Article and Find Full Text PDF

Copepod Lipidomics: Fatty Acid Substituents of Structural Lipids in , a Dominant Species in the Food Chain of the Apalachicola Estuary of the Gulf of Mexico.

Life (Basel)

December 2024

Imaging and Analysis Center, DeBusk College of Osteopathic Medicine, Lincoln Memorial University, 6965 Cumberland Gap Pkwy, Harrogate, TN 37752, USA.

Zooplanktonic copepods represent a major biological mass in the marine food chain that can be affected by climate change. Monitoring the health of this critical biomass is essential for increasing our understanding of the impact of environmental changes on marine environments. Since the lipidomes of marine organisms are known to adapt to alterations in pH, temperature, and availability of metabolic precursors, lipidomics is one technology that can be used for monitoring copepod adaptations.

View Article and Find Full Text PDF

Genome-Wide Identification and Expression Profile of () Gene Family in L.

Int J Mol Sci

January 2025

State Key Laboratory of Tropical Crop Breeding, Sanya Institute, Rubber Research Institute, Chinese Academy of Tropical Agricultural Sciences, Sanya 572025, China.

The biosynthesis of isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP), which are essential for sesquiterpenes and triterpenes, respectively, is primarily governed by the mevalonate pathway, wherein () plays a pivotal role. This study identified eight members of the FPS gene family in , designated -, through bioinformatics analysis, revealing their distribution across several chromosomes and a notable tandem gene cluster. The genes exhibited strong hydrophilic properties and key functional motifs crucial for enzyme activity.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!