AI Article Synopsis

  • GIGANTEA (GI) protein in plants is crucial for various functions like metabolism and growth, and traditional methods for identifying GI proteins are resource-heavy.
  • A new computational model was developed using ten supervised learning algorithms to predict GI proteins accurately, achieving high accuracy rates with the support of amino acid composition and other properties.
  • The "GIpred" prediction server was created for easy access to the identification of GI proteins, making it a useful tool for researchers in the field.

Article Abstract

Unlabelled: In plants, GIGANTEA (GI) protein plays different biological functions including carbon and sucrose metabolism, cell wall deposition, transpiration and hypocotyl elongation. This suggests that GI is an important class of proteins. So far, the resource-intensive experimental methods have been mostly utilized for identification of GI proteins. Thus, we made an attempt in this study to develop a computational model for fast and accurate prediction of GI proteins. Ten different supervised learning algorithms i.e., SVM, RF, JRIP, J48, LMT, IBK, NB, PART, BAGG and LGB were employed for prediction, where the amino acid composition (AAC), FASGAI features and physico-chemical (PHYC) properties were used as numerical inputs for the learning algorithms. Higher accuracies i.e., 96.75% of AUC-ROC and 86.7% of AUC-PR were observed for SVM coupled with AAC + PHYC feature combination, while evaluated with five-fold cross validation. With leave-one-out cross validation, 97.29% of AUC-ROC and 87.89% of AUC-PR were respectively achieved. While the performance of the model was evaluated with an independent dataset of 18 GI sequences, 17 were observed as correctly predicted. We have also performed proteome-wide identification of GI proteins in wheat, followed by functional annotation using Gene Ontology terms. A prediction server "GIpred" is freely accessible at http://cabgrid.res.in:8080/gipred/ for proteome-wide recognition of GI proteins.

Supplementary Information: The online version contains supplementary material available at 10.1007/s12298-022-01130-6.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8847649PMC
http://dx.doi.org/10.1007/s12298-022-01130-6DOI Listing

Publication Analysis

Top Keywords

identification proteins
8
learning algorithms
8
cross validation
8
proteins
5
gipred computational
4
computational tool
4
prediction
4
tool prediction
4
prediction gigantea
4
gigantea proteins
4

Similar Publications

Objective And Significance: Transforming growth factor-beta (TGF-β) plays a pivotal role in breast development by modulating tissue composition during the developmental phase. The TGFβ type II receptor (TGFβ RII) is implicated in breast cancer and represents a valuable therapeutic target. Due to the off-target side effects of many existing TGFβI/TGFβ RII inhibitors, a more targeted approach to drug discovery is necessary.

View Article and Find Full Text PDF

Efficient Biochemical Method for Characterizing and Classifying Related Amyloidogenic Peptides.

Anal Chem

January 2025

Institut de Recherche en Santé, Environnement et Travail (Irset)─Inserm─EHESP, UMR_S 1085, Université de Rennes, 9 av. du Professeur Léon Bernard, F-35042 Rennes, France.

Amyloidosis is a group of proteinopathies characterized by the systemic or organ-specific deposition of proteins in the form of amyloid fibers. Nearly 40 proteins play a role in these pathologies, and the structures of the associated fibers are beginning to be determined by Cryo-EM. However, the molecular events underlying the process, such as fiber nucleation and elongation, are poorly understood, which impairs developing efficient therapies.

View Article and Find Full Text PDF

Characterizing biodiversity using environmental DNA (eDNA) represents a paradigm shift in our capacity for biomonitoring complex environments, both aquatic and terrestrial. However, eDNA biomonitoring is limited by biases toward certain species and the low taxonomic resolution of current metabarcoding approaches. Shotgun metagenomics of eDNA enables the collection of whole ecosystem data by sequencing all molecules present, allowing characterization and identification.

View Article and Find Full Text PDF

is a recently described species that can be differentiated from . However, in clinical settings, they are frequently misidentified as . In this study, our objective was to conduct genomic characterization and bioinformatics analysis of subsp.

View Article and Find Full Text PDF

Resilience mechanisms underlying Alzheimer's disease.

Metab Brain Dis

January 2025

Division of Applied Biomedical Science and Biotechnology, School of Health Science, IMU University, No. 126, Jalan Jalil Perkasa 19, Bukit Jalil, 57000, Kuala Lumpur, Malaysia.

Alzheimer's disease (AD) consists of two main pathologies, which are the deposition of amyloid plaque as well as tau protein aggregation. Evidence suggests that not everyone who carries the AD-causing genes displays AD-related symptoms; they might never acquire AD as well. These individuals are referred to as non-demented individuals with AD neuropathology (NDAN).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!