Essential genes identification model based on sequence feature map and graph convolutional neural network.

BMC Genomics

College of Physics and Electronic Information, Gannan Normal University, Ganzhou, Jiangxi, 341000, China.

Published: January 2024

Background: Essential genes encode functions that play a vital role in the life activities of organisms, encompassing growth, development, immune system functioning, and cell structure maintenance. Conventional experimental techniques for identifying essential genes are resource-intensive and time-consuming, and the accuracy of current machine learning models needs further enhancement. Therefore, it is crucial to develop a robust computational model to accurately predict essential genes.

Results: In this study, we introduce GCNN-SFM, a computational model for identifying essential genes in organisms, based on graph convolutional neural networks (GCNN). GCNN-SFM integrates a graph convolutional layer, a convolutional layer, and a fully connected layer to model and extract features from gene sequences of essential genes. Initially, the gene sequence is transformed into a feature map using coding techniques. Subsequently, a multi-layer GCN is employed to perform graph convolution operations, effectively capturing both local and global features of the gene sequence. Further feature extraction is performed, followed by integrating convolution and fully-connected layers to generate prediction results for essential genes. The gradient descent algorithm is utilized to iteratively update the cross-entropy loss function, thereby enhancing the accuracy of the prediction results. Meanwhile, model parameters are tuned to determine the optimal parameter combination that yields the best prediction performance during training.

Conclusions: Experimental evaluation demonstrates that GCNN-SFM surpasses various advanced essential gene prediction models and achieves an average accuracy of 94.53%. This study presents a novel and effective approach for identifying essential genes, which has significant implications for biology and genomics research.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10777564PMC
http://dx.doi.org/10.1186/s12864-024-09958-wDOI Listing

Publication Analysis

Top Keywords

essential genes
28
graph convolutional
12
identifying essential
12
essential
9
sequence feature
8
feature map
8
convolutional neural
8
computational model
8
convolutional layer
8
features gene
8

Similar Publications

Gammaherpesviruses are oncogenic pathogens that establish lifelong infections. There are no FDA-approved vaccines against Epstein-Barr virus or Kaposi sarcoma herpesvirus. Murine gammaherpesvirus-68 (MHV68) infection of mice provides a system for investigating gammaherpesvirus pathogenesis and testing vaccine strategies.

View Article and Find Full Text PDF

Epstein-Barr virus (EBV) and Kaposi's sarcoma-associated herpesvirus (KSHV), which are the only members of the gamma(γ) herpesviruses, are oncogenic viruses that significantly contribute to the development of various human cancers, such as Burkitt's lymphoma, nasopharyngeal carcinoma, Hodgkin's lymphoma, Kaposi's sarcoma, and primary effusion lymphoma. Oncogenesis triggered by γ-herpesviruses involves complex interactions between viral genetics, host cellular mechanisms, and immune evasion strategies. At the genetic level, crucial viral oncogenes participate in the disruption of cell signaling, leading to uncontrolled proliferation and inhibition of apoptosis.

View Article and Find Full Text PDF

During virus infection, the activation of the antiviral endoribonuclease, ribonuclease L (RNase L), by a unique ligand 2'-5'-oilgoadenylate (2-5A) causes the cleavage of single-stranded viral and cellular RNA targets, restricting protein synthesis, activating stress response pathways, and promoting cell death to establish broad antiviral effects. The immunostimulatory dsRNA cleavage products of RNase L activity (RL RNAs) recruit diverse dsRNA sensors to activate signaling pathways to amplify interferon (IFN) production and activate inflammasome, but the sensors that promote cell death are not known. In this study, we found that DEAH-box polypeptide 15 (DHX15) and retinoic acid-inducible gene I (Rig-I) are essential for apoptosis induced by RL RNAs and require mitochondrial antiviral signaling (MAVS), c-Jun amino terminal kinase (JNK), and p38 mitogen-activated protein kinase (p38 MAPK) for caspase-3-mediated intrinsic apoptosis.

View Article and Find Full Text PDF

Successful pollination and fertilization are crucial for grain setting in cereals. Wheat is an allohexaploid autogamous species. Due to its evolutionary history, the genetic diversity of current bread wheat () cultivars is limited.

View Article and Find Full Text PDF

The gene family plays a crucial role in plant growth, development, and responses to biotic and abiotic stresses. , a warm-season turfgrass with exceptional salt tolerance, can be irrigated with seawater. However, the gene family in seashore paspalum remains poorly understood.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!