Background: Identification of genomic patterns in tumors is an important problem, which would enable the community to understand and extend effective therapies across the current tissue-based tumor boundaries. With this in mind, in this work we develop a robust and fast algorithm to discover cancer driver genes using an unsupervised clustering of similarly expressed genes across cancer patients. Specifically, we introduce CaMoDi, a new method for module discovery which demonstrates superior performance across a number of computational and statistical metrics.

Results: The proposed algorithm CaMoDi demonstrates effective statistical performance compared to the state of the art, and is algorithmically simple and scalable - which makes it suitable for tissue-independent genomic characterization of individual tumors as well as groups of tumors. We perform an extensive comparative study between CaMoDi and two previously developed methods (CONEXIC and AMARETTO), across 11 individual tumors and 8 combinations of tumors from The Cancer Genome Atlas. We demonstrate that CaMoDi is able to discover modules with better average consistency and homogeneity, with similar or better adjusted R2 performance compared to CONEXIC and AMARETTO.

Conclusions: We present a novel method for Cancer Module Discovery, CaMoDi, and demonstrate through extensive simulations on the TCGA Pan-Cancer dataset that it achieves comparable or better performance than that of CONEXIC and AMARETTO, while achieving an order-of-magnitude improvement in computational run time compared to the other methods.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4304219PMC
http://dx.doi.org/10.1186/1471-2164-15-S10-S8DOI Listing

Publication Analysis

Top Keywords

module discovery
12
camodi method
8
method cancer
8
cancer module
8
performance compared
8
individual tumors
8
conexic amaretto
8
camodi
6
cancer
5
tumors
5

Similar Publications

CICADA: a circRNA effort toward the ghost proteome.

Nucleic Acids Res

December 2024

Binzhou People's Hospital Affiliated to Shandong First Medical University/College of Medical Information and Artificial Intelligence, Shandong First Medical University and Shandong Academy of Medical Sciences, Jinan, Shandong 250117, China.

Recent studies have confirmed that certain circRNAs encode proteins that are integral to various biological functions. In this study, we present CICADA, an algorithm specifically designed to assess the protein-coding potential and coding products of circRNAs at high throughput, which enables the identification of previously unknown circRNA-encoded proteins. By harnessing the potential of this algorithm, we identified a variety of functional, protein-coding circRNAs in esophageal squamous cell carcinoma and established circRNA translation profiles for diverse types of cancer.

View Article and Find Full Text PDF

Identification of hub genes, diagnostic model, and immune infiltration in preeclampsia by integrated bioinformatics analysis and machine learning.

BMC Pregnancy Childbirth

December 2024

Department of Gynecology, Fujian Maternity and Child Health Hospital, College of Clinical Medicine for Obstetrics & Gynecology and Pediatrics, Fujian Medical University, Fuzhou, Fujian, 350001, China.

Purpose: This study aimed to identify novel biomarkers for preeclampsia (PE) diagnosis by integrating Weighted Gene Co-expression Network Analysis (WGCNA) with machine learning techniques.

Patients And Methods: We obtained the PE dataset GSE25906 from the gene expression omnibus (GEO) database. Analysis of differentially expressed genes (DEGs) and module genes with Limma and Weighted Gene Co-expression Network analysis (WGCNA).

View Article and Find Full Text PDF

BioStructNet: Structure-Based Network with Transfer Learning for Predicting Biocatalyst Functions.

J Chem Theory Comput

December 2024

School of Chemistry and Chemical Engineering, Queen's University Belfast, BT9 5AG Belfast, Northern Ireland, U.K.

Enzyme-substrate interactions are essential to both biological processes and industrial applications. Advanced machine learning techniques have significantly accelerated biocatalysis research, revolutionizing the prediction of biocatalytic activities and facilitating the discovery of novel biocatalysts. However, the limited availability of data for specific enzyme functions, such as conversion efficiency and stereoselectivity, presents challenges for prediction accuracy.

View Article and Find Full Text PDF

Deep representation learning of protein-protein interaction networks for enhanced pattern discovery.

Sci Adv

December 2024

Institute for Computational and Mathematical Engineering, Stanford University, Stanford, CA 94305, USA.

Protein-protein interaction (PPI) networks, where nodes represent proteins and edges depict myriad interactions among them, are fundamental to understanding the dynamics within biological systems. Despite their pivotal role in modern biology, reliably discerning patterns from these intertwined networks remains a substantial challenge. The essence of the challenge lies in holistically characterizing the relationships of each node with others in the network and effectively using this information for accurate pattern discovery.

View Article and Find Full Text PDF

MORE: a multi-omics data-driven hypergraph integration network for biomedical data classification and biomarker identification.

Brief Bioinform

November 2024

State Key Laboratory of Organic Electronics and Information Displays & Institute of Advanced Materials (IAM), Nanjing University of Posts & Telecommunications, 9 Wenyuan, Nanjing 210023, China.

High-throughput sequencing methods have brought about a huge change in omics-based biomedical study. Integrating various omics data is possibly useful for identifying some correlations across data modalities, thus improving our understanding of the underlying biological mechanisms and complexity. Nevertheless, most existing graph-based feature extraction methods overlook the complementary information and correlations across modalities.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!