AI Article Synopsis

  • Researchers have developed a new computational method to identify candidate genes for diseases lacking prior knowledge, improving gene prioritization using experimental data on gene expression.
  • The study applied various machine learning techniques, including network-based approaches, to better score disease candidate genes by assessing their relationships with differentially expressed genes.
  • Results indicated that the new methods significantly outperformed traditional ranking approaches, with the Heat Kernel Diffusion Ranking achieving the highest accuracy and lowering error rates in gene ranking.

Article Abstract

Background: Discovering novel disease genes is still challenging for diseases for which no prior knowledge--such as known disease genes or disease-related pathways--is available. Performing genetic studies frequently results in large lists of candidate genes of which only few can be followed up for further investigation. We have recently developed a computational method for constitutional genetic disorders that identifies the most promising candidate genes by replacing prior knowledge by experimental data of differential gene expression between affected and healthy individuals.To improve the performance of our prioritization strategy, we have extended our previous work by applying different machine learning approaches that identify promising candidate genes by determining whether a gene is surrounded by highly differentially expressed genes in a functional association or protein-protein interaction network.

Results: We have proposed three strategies scoring disease candidate genes relying on network-based machine learning approaches, such as kernel ridge regression, heat kernel, and Arnoldi kernel approximation. For comparison purposes, a local measure based on the expression of the direct neighbors is also computed. We have benchmarked these strategies on 40 publicly available knockout experiments in mice, and performance was assessed against results obtained using a standard procedure in genetics that ranks candidate genes based solely on their differential expression levels (Simple Expression Ranking). Our results showed that our four strategies could outperform this standard procedure and that the best results were obtained using the Heat Kernel Diffusion Ranking leading to an average ranking position of 8 out of 100 genes, an AUC value of 92.3% and an error reduction of 52.8% relative to the standard procedure approach which ranked the knockout gene on average at position 17 with an AUC value of 83.7%.

Conclusion: In this study we could identify promising candidate genes using network based machine learning approaches even if no knowledge is available about the disease or phenotype.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2945940PMC
http://dx.doi.org/10.1186/1471-2105-11-460DOI Listing

Publication Analysis

Top Keywords

candidate genes
24
machine learning
16
learning approaches
16
promising candidate
12
standard procedure
12
genes
10
differential expression
8
disease genes
8
identify promising
8
heat kernel
8

Similar Publications

The central nervous system is affected by multiple sclerosis (MS), a chronic autoimmune illness characterized by axonal destruction, demyelination, and inflammation. This article summarizes the state of the field, highlighting its complexity and significant influence on people's quality of life. The research employs a network pharmacological approach, integrating systems biology, bioinformatics, and pharmacology to identify biomarkers associated with MS.

View Article and Find Full Text PDF

Genetic landscape in undiagnosed patients with syndromic hearing loss revealed by whole exome sequencing and phenotype similarity search.

Hum Genet

January 2025

Division of Hearing and Balance Research, National Institute of Sensory Organs, NHO Tokyo Medical Center, 2-5-1 Higashigaoka, Meguro-Ku, Tokyo, 152-8902, Japan.

There are hundreds of rare syndromic diseases involving hearing loss, many of which are not targeted for clinical genetic testing. We systematically explored the genetic causes of undiagnosed syndromic hearing loss using a combination of whole exome sequencing (WES) and a phenotype similarity search system called PubCaseFinder. Fifty-five families with syndromic hearing loss of unknown cause were analyzed using WES after prescreening of several deafness genes depending on patient clinical features.

View Article and Find Full Text PDF

Metabolic reprogramming induced by PSMA4 overexpression facilitates bortezomib resistance in multiple myeloma.

Ann Hematol

January 2025

Department of Hematology, Navy Medical Center of PLA, Naval Medical University, No. 338 West Huaihai Road, Changning District, Shanghai, 200052, China.

Multiple myeloma(MM) remains incurable with high relapse and chemoresistance rates. Differentially expressed genes(DEGs) between newly diagnosed myeloma and secondary plasma cell leukemia(sPCL) were subjected to a weighted gene co-expression network analysis(WGCNA). Drug resistant myeloma cell lines were established.

View Article and Find Full Text PDF

Metabotropic glutamate (mGlu) receptors are candidate drug targets for therapeutic intervention in Parkinson's disease (PD). Here we focused on mGlu3, a receptor subtype involved in synaptic regulation and neuroinflammation. mGlu3 mice showed an enhanced nigro-striatal damage and microglial activation in response to 1-methyl-4-phenyl-1,2,3,6-tetrahydropyridine (MPTP).

View Article and Find Full Text PDF

Rapid and accurate multi-phenotype imputation for millions of individuals.

Nat Commun

January 2025

Key Laboratory of Healthy Mariculture for the East China Sea, Ministry of Agriculture and Rural Affairs & Fisheries college, Jimei University, Xiamen, Fujian, People's Republic of China.

Deep phenotyping can enhance the power of genetic analysis, including genome-wide association studies (GWAS), but the occurrence of missing phenotypes compromises the potential of such resources. Although many phenotypic imputation methods have been developed, the accurate imputation of millions of individuals remains challenging. In the present study, we have developed a multi-phenotype imputation method based on mixed fast random forest (PIXANT) by leveraging efficient machine learning (ML)-based algorithms.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!