DeepHE: Accurately predicting human essential genes based on deep learning.

PLoS Comput Biol

Boston Latin School, Boston, Massachusetts, United States of America.

Published: September 2020

Accurately predicting essential genes using computational methods can greatly reduce the effort in finding them via wet experiments at both time and resource scales, and further accelerate the process of drug discovery. Several computational methods have been proposed for predicting essential genes in model organisms by integrating multiple biological data sources either via centrality measures or machine learning based methods. However, the methods aiming to predict human essential genes are still limited and the performance still need improve. In addition, most of the machine learning based essential gene prediction methods are lack of skills to handle the imbalanced learning issue inherent in the essential gene prediction problem, which might be one factor affecting their performance. We propose a deep learning based method, DeepHE, to predict human essential genes by integrating features derived from sequence data and protein-protein interaction (PPI) network. A deep learning based network embedding method is utilized to automatically learn features from PPI network. In addition, 89 sequence features were derived from DNA sequence and protein sequence for each gene. These two types of features are integrated to train a multilayer neural network. A cost-sensitive technique is used to address the imbalanced learning problem when training the deep neural network. The experimental results for predicting human essential genes show that our proposed method, DeepHE, can accurately predict human gene essentiality with an average performance of AUC higher than 94%, the area under precision-recall curve (AP) higher than 90%, and the accuracy higher than 90%. We also compare DeepHE with several widely used traditional machine learning models (SVM, Naïve Bayes, Random Forest, and Adaboost) using the same features and utilizing the same cost-sensitive technique to against the imbalanced learning issue. The experimental results show that DeepHE significantly outperforms the compared machine learning models. We have demonstrated that human essential genes can be accurately predicted by designing effective machine learning algorithm and integrating representative features captured from available biological data. The proposed deep learning framework is effective for such task.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7521708PMC
http://dx.doi.org/10.1371/journal.pcbi.1008229DOI Listing

Publication Analysis

Top Keywords

essential genes
28
human essential
20
machine learning
20
deep learning
16
learning based
16
learning
12
predict human
12
imbalanced learning
12
essential
9
deephe accurately
8

Similar Publications

Melanoma is an aggressive type of skin cancer that arises from melanocytes, the cells responsible for producing skin pigment. In contrast to non-melanoma skin cancers like basal cell carcinoma and squamous cell carcinoma, melanoma is more invasive. Melanoma was distinguished by its rapid progression, high metastatic potential, and significant resistance to conventional therapies.

View Article and Find Full Text PDF

Thiamin, an essential micronutrient, is a cofactor for enzymes involved in the central carbon metabolism and amino acids pathways. Despite efforts to enhance thiamin content in rice by incorporating thiamin biosynthetic genes, increasing thiamin content in endosperm remains challenging, possibly due to a lack of thiamin stability and/or a local sink. The introduction of storage proteins has been successful in biofortification strategies and similar efforts targeting thiamin led to a 3-4-fold increase in white rice.

View Article and Find Full Text PDF

Genomic and Methylomic Signatures Associated With the Maintenance of Genome Stability and Adaptive Evolution in Two Closely Allied Wolf Spiders.

Mol Ecol Resour

January 2025

Key Laboratory of Eco-Environments in Three Gorges Reservoir Region (Ministry of Education), School of Life Sciences, Southwest University, Chongqing, China.

Pardosa spiders, belonging to the wolf spider family Lycosidae, play a vital role in maintaining the health of forest and agricultural ecosystems due to their function in pest control. This study presents chromosome-level genome assemblies for two allied Pardosa species, P. laura and P.

View Article and Find Full Text PDF

Baseline gene expression in BALB/c and C57BL/6 peritoneal macrophages influences but does not dictate their functional phenotypes.

Exp Biol Med (Maywood)

January 2025

Centro de Biología Celular y Molecular de Enfermedades, Instituto de Investigaciones Científicas y Servicios de Alta Tecnología (INDICASAT AIP), Panama City, Panama.

Macrophages are effector cells of the immune system and essential modulators of immune responses. Different functional phenotypes of macrophages with specific roles in the response to stimuli have been described. The C57BL/6 and BALB/c mouse strains tend to selectively display distinct macrophage activation states in response to pathogens, namely, the M1 and M2 phenotypes, respectively.

View Article and Find Full Text PDF

Background: Previous studies reported significant relationships between obesity and pulmonary dysfunction. Here, we investigated genetic alterations in the lung tissues of high fat diet (HFD) induced obese mouse through transcriptomic and molecular analyses.

Methods: Eight-week-old male C57BL/6J mice were fed either a normal chow diet (NCD) or HFD for 12 weeks.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!