One of the major challenges in defining clinically-relevant and less heterogeneous tumor subtypes is assigning biological and/or clinical interpretations to etiological (intrinsic) subtypes. Conventional clustering/subtyping approaches often fail to define such subtypes, as they involve several discrete steps. Here we demonstrate a unique machine-learning method, phenotype mapping (), which jointly integrates single omics data with phenotypic information using three published breast cancer datasets (n = 2045). The framework uses a modified factor analysis method that is governed by a key assumption that, features from different omics data types are correlated due to specific "hidden/mapping" variables (context-specific mapping variables (CMV)). These variables can be simultaneously modeled with phenotypic data as covariates to yield functional subtypes and their associated features (e.g., genes) and phenotypes. In one example, we demonstrate the identification and validation of six novel "functional" (discrete) subtypes with differential responses to a cyclin-dependent kinase (CDK)4/6 inhibitor and etoposide by jointly integrating transcriptome profiles with four different drug response data from 37 breast cancer cell lines. These robust subtypes are also present in patient breast tumors with different prognosis. In another example, we modeled patient gene expression profiles and clinical covariates together to identify continuous subtypes with clinical/biological implications. Overall, this genome-phenome machine-learning integration tool, identifies functional and phenotype-integrated discrete or continuous subtypes with clinical translational potential.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7601761PMC
http://dx.doi.org/10.3390/cancers12102811DOI Listing

Publication Analysis

Top Keywords

single omics
8
subtypes
8
omics data
8
breast cancer
8
continuous subtypes
8
data
5
machine-learning tool
4
tool concurrently
4
concurrently models
4
models single
4

Similar Publications

scMMAE: masked cross-attention network for single-cell multimodal omics fusion to enhance unimodal omics.

Brief Bioinform

November 2024

Guangdong Provincial Key Laboratory of Mathematical and Neural Dynamical Systems, Great Bay University, No. 16 Daxue Rd, Songshanhu District, Dongguan, Guangdong, 523000, China.

Multimodal omics provide deeper insight into the biological processes and cellular functions, especially transcriptomics and proteomics. Computational methods have been proposed for the integration of single-cell multimodal omics of transcriptomics and proteomics. However, existing methods primarily concentrate on the alignment of different omics, overlooking the unique information inherent in each omics type.

View Article and Find Full Text PDF

Molecular characterization of tumors is essential to identify predictive biomarkers that inform treatment decisions and improve precision immunotherapy development and administration. However, challenges such as the heterogeneity of tumors and patient responses, limited efficacy of current biomarkers, and the predominant reliance on single-omics data, have hindered advances in accurately predicting treatment outcomes. Standard therapy generally applies a "one size fits all" approach, which not only provides ineffective or limited responses, but also an increased risk of off-target toxicities and acceleration of resistance mechanisms or adverse effects.

View Article and Find Full Text PDF

Background: An accurate diagnosis of septic versus reactive or autoimmune arthritis remains clinically challenging. A multi-omics strategy comprising metagenomic and proteomic technologies were undertaken for children diagnosed with presumed septic arthritis to advance clinical diagnoses and care for affected individuals.

Methods: Twelve children with suspected septic arthritis were prospectively enrolled to compare standard of care tests with a rapid multi-omics approach.

View Article and Find Full Text PDF

Background: There is increasing need for effective incorporation of high-dimensional genetics data from individuals with varied ancestry in genome-wide association (GWAS) analyses. Classically, multi-ancestry GWAS analyses are performed using statistical meta-analysis to combine results conducted within homogeneous ancestry groups. The emergence of cosmopolitan reference panels makes collective preprocessing of GWAS data possible, but impact on downstream GWAS results in a mega-analysis framework merits investigation.

View Article and Find Full Text PDF

Multimodal integration using a machine learning approach facilitates risk stratification in HR+/HER2- breast cancer.

Cell Rep Med

January 2025

Key Laboratory of Breast Cancer in Shanghai, Department of Breast Surgery, Fudan University Shanghai Cancer Center, Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, P.R.China. Electronic address:

Hormone receptor-positive (HR+)/human epidermal growth factor receptor 2-negative (HER2-) breast cancer is the most common type of breast cancer, with continuous recurrence remaining an important clinical issue. Current relapse predictive models in HR+/HER2- breast cancer patients still have limitations. The integration of multidimensional data represents a promising alternative for predicting relapse.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!