Background: Protein feature extraction plays an important role in the areas of similarity analysis of protein sequences and prediction of protein structures, functions and interactions. The feature extraction based on graphical representation is one of the most effective and efficient ways. However, most existing methods suffer limitations from their method design.
Results: We introduce DCGR, a novel method for extracting features from protein sequences based on the chaos game representation, which is developed by constructing CGR curves of protein sequences according to physicochemical properties of amino acids, followed by converting the CGR curves into multi-dimensional feature vectors by using the distributions of points in CGR images. Tested on five data sets, DCGR was significantly superior to the state-of-the-art feature extraction methods.
Conclusion: The DCGR is practically powerful for extracting effective features from protein sequences, and therefore important in similarity analysis of protein sequences, study of protein-protein interactions and prediction of protein functions. It is freely available at https://sourceforge.net/projects/transcriptomeassembly/files/Feature%20Extraction .
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6587251 | PMC |
http://dx.doi.org/10.1186/s12859-019-2943-x | DOI Listing |
J Vis Exp
January 2025
State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Xiamen University;
The extent of functional sequences within the human genome is a pivotal yet debated topic in biology. Although high-throughput reverse genetic screens have made strides in exploring this, they often limit their scope to known genomic elements and may introduce non-specific effects. This underscores the urgent need for novel functional genomics tools that enable a deeper, unbiased understanding of genome functionality.
View Article and Find Full Text PDFWorld J Microbiol Biotechnol
January 2025
Graduate School of Science and Technology, Shizuoka University, Shizuoka, Japan.
Marine resources are attractive for screening new useful bacteria. From a marine sediment sample, we performed isolation and screening of bacterial strains in search of new bioactive compounds. HPLC and ESI-MS analysis indicated that the new bacterium, Lysinibacillus sp.
View Article and Find Full Text PDFGenome Biol Evol
January 2025
School of Biological Sciences, Institute of Ecology and Evolution, The University of Edinburgh, Edinburgh EH9 3FL, UK.
Meiosis is generally a fair process: each chromosome has a 50% chance of being included into each gamete. However, meiosis can become aberrant with some chromosomes having a higher chance of making it into gametes than others. Yet, why and how such systems evolve remains unclear.
View Article and Find Full Text PDFAppl Environ Microbiol
January 2025
Joint Degree Program of Kasetsart University and Yamaguchi University, Graduate School of Science and Technology for Innovation, Yamaguchi University, Yamaguchi, Japan.
Unlabelled: Incomplete oxidation of glucose by sp. strain CHM43 produces gluconic acid and then 2- or 5-ketogluconic acid. Although 2-keto-D-gluconate (2KG) is a valuable compound, it is sometimes consumed by itself via an unknown metabolic pathway.
View Article and Find Full Text PDFmBio
January 2025
Department of Microbiology and Immunology, University of Rochester Medical Center, Rochester, New York, USA.
Unlabelled: Pathogenic strains cause cholera using different mechanisms. O1 and O139 serogroup strains use the toxin-co-regulated pilus (TCP) and cholera toxin (CT) for intestinal colonization and to promote secretory diarrhea, while non-O1/non-O139 serogroup strains are typically non-toxigenic and use alternate virulence factors to cause a clinically similar disease. An O39 serogroup, TCP/CT-negative strain, named AM-19226, uses a type III secretion system (T3SS) to translocate more than 10 effector proteins into the host cell cytosol.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!