Consensus clustering has been widely used in bioinformatics and other applications to improve the accuracy, stability and reliability of clustering results. This approach ensembles cluster co-occurrences from multiple clustering runs on subsampled observations. For application to large-scale bioinformatics data, such as to discover cell types from single-cell sequencing data, for example, consensus clustering has two significant drawbacks: (i) computational inefficiency due to repeatedly applying clustering algorithms, and (ii) lack of interpretability into the important features for differentiating clusters. In this paper, we address these two challenges by developing IMPACC: Interpretable MiniPatch Adaptive Consensus Clustering. Our approach adopts three major innovations. We ensemble cluster co-occurrences from tiny subsets of both observations and features, termed minipatches, thus dramatically reducing computation time. Additionally, we develop adaptive sampling schemes for observations, which result in both improved reliability and computational savings, as well as adaptive sampling schemes of features, which lead to interpretable solutions by quickly learning the most relevant features that differentiate clusters. We study our approach on synthetic data and a variety of real large-scale bioinformatics data sets; results show that our approach not only yields more accurate and interpretable cluster solutions, but it also substantially improves computational efficiency compared to standard consensus clustering approaches.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9560608 | PMC |
http://dx.doi.org/10.1371/journal.pcbi.1010577 | DOI Listing |
Methods
January 2025
Department of Computer Science and Engineering, School of Information Science and Engineering, Yunnan University, Kunming, 650504, Yunnan, China. Electronic address:
Cancer is a complex and heterogeneous disease, and accurate cancer subtyping can significantly improve patient survival rates. The complexity of cancer spans multiple omics levels, and analyzing multi-omics data for cancer subtyping is a focus of considerable attention. However, extracting complementary information from different omics data sources and adaptively integrating them remains a major challenge.
View Article and Find Full Text PDFDiscov Oncol
January 2025
Department of Laboratory, the Second Hospital of Shanxi Medical University, No. 382, Wuyi Road, Taiyuan, 030001, Shanxi, People's Republic of China.
Background: Pancreatic cancer (PAC) has a complex tumor immune microenvironment, and currently, there is a lack of accurate personalized treatment. Establishing a novel consensus machine learning driven signature (CMLS) that offers a unique predictive model and possible treatment targets for this condition was the goal of this study.
Methods: This study integrated multiple omics data of PAC patients, applied ten clustering techniques and ten machine learning approaches to construct molecular subtypes for PAC, and created a new CMLS.
Front Immunol
January 2025
Department of Gynecology, Handan Central Hospital, Handan, China.
Background: Ferroptosis, a recently discovered iron-dependent cell death, is linked to various diseases but its role in endometriosis is still not fully understood.
Methods: In this study, we integrated microarray data of endometriosis from the GEO database and ferroptosis-related genes (FRGs) from the FerrDb database to further investigate the regulation of ferroptosis in endometriosis and its impact on the immune microenvironment. WGCNA identified ferroptosis-related modules, annotated by GO & KEGG.
Front Immunol
January 2025
Department of Thyroid & Breast Surgery, The First People's Hospital of Xiaoshan District, Xiaoshan Affiliated Hospital of Wenzhou Medical University, Hangzhou, Zhejiang, China.
Introduction: While most thyroid cancer patients have a favorable prognosis, anaplastic thyroid carcinoma (ATC) remains a particularly aggressive form with a median survival time of just five months. Conventional therapies offer limited benefits for this type of thyroid cancer. Our study aims to identify ATC patients who might bene t from immunotherapy.
View Article and Find Full Text PDFJ Transl Med
January 2025
School of Nursing, Nanjing Medical University, Nanjing, 211166, China.
Background: Tumor-associated fibrosis modifies the tumor microenvironment (TME), hinders the infiltration and activity of cytotoxic immune cells, and is a critical pathological process leading to the ineffectiveness of tumor immunotherapy in gastric cancer (GC). However, the specific mechanisms and interventions are yet to be fully explored.
Methods: Our study included 375 gastric cancer samples from TCGA, 1 single-cell RNA sequencing (scRNA-seq) dataset comprising of 15 gastric cancer samples from GEO, 19 cohorts of immunotherapy and 2 GWAS datasets.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!