Classifying breast cancer using multi-view graph neural network based on multi-omics data.

Front Genet

Translational Medical Center for Stem Cell Therapy and Institute for Regenerative Medicine, Shanghai East Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, China.

Published: February 2024

AI Article Synopsis

  • The paper discusses the importance of cancer grading and subtyping for predicting prognosis and treatment, highlighting a gap in current methods that often rely on single omics data.
  • It introduces a new multi-omics data fusion algorithm called MVGNN, which uses a graph convolutional network combined with an attention module to improve cancer classification and subtype prediction.
  • Experimental results show that the MVGNN model outperforms traditional machine learning approaches and single or dual omics data methods, indicating its effectiveness in integrating multiple data types for cancer classification.

Article Abstract

As the evaluation indices, cancer grading and subtyping have diverse clinical, pathological, and molecular characteristics with prognostic and therapeutic implications. Although researchers have begun to study cancer differentiation and subtype prediction, most of relevant methods are based on traditional machine learning and rely on single omics data. It is necessary to explore a deep learning algorithm that integrates multi-omics data to achieve classification prediction of cancer differentiation and subtypes. This paper proposes a multi-omics data fusion algorithm based on a multi-view graph neural network (MVGNN) for predicting cancer differentiation and subtype classification. The model framework consists of a graph convolutional network (GCN) module for learning features from different omics data and an attention module for integrating multi-omics data. Three different types of omics data are used. For each type of omics data, feature selection is performed using methods such as the chi-square test and minimum redundancy maximum relevance (mRMR). Weighted patient similarity networks are constructed based on the selected omics features, and GCN is trained using omics features and corresponding similarity networks. Finally, an attention module integrates different types of omics features and performs the final cancer classification prediction. To validate the cancer classification predictive performance of the MVGNN model, we conducted experimental comparisons with traditional machine learning models and currently popular methods based on integrating multi-omics data using 5-fold cross-validation. Additionally, we performed comparative experiments on cancer differentiation and its subtypes based on single omics data, two omics data, and three omics data. This paper proposed the MVGNN model and it performed well in cancer classification prediction based on multiple omics data.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10912483PMC
http://dx.doi.org/10.3389/fgene.2024.1363896DOI Listing

Publication Analysis

Top Keywords

omics data
32
multi-omics data
20
cancer differentiation
16
data
13
classification prediction
12
omics features
12
cancer classification
12
omics
11
cancer
9
multi-view graph
8

Similar Publications

Fibrolamellar Hepatocellular Carcinoma (FLC) is a rare liver cancer characterized by a fusion oncokinase of the genes DNAJB1 and PRKACA, the catalytic subunit of protein kinase A (PKA). A few FLC-like tumors have been reported showing other alterations involving PKA. To better understand FLC pathogenesis and the relationships among FLC, FLC-like, and other liver tumors, we performed a massive multi-omics analysis.

View Article and Find Full Text PDF

Dual-stage optimizer for systematic overestimation adjustment applied to multi-objective genetic algorithms for biomarker selection.

Brief Bioinform

November 2024

School of Medicine, Institute of Biomedicine, University of Eastern Finland, Yliopistonranta 1, PO Box 1627, 70211 Kuopio, Finland.

The selection of biomarker panels in omics data, challenged by numerous molecular features and limited samples, often requires the use of machine learning methods paired with wrapper feature selection techniques, like genetic algorithms. They test various feature sets-potential biomarker solutions-to fine-tune a machine learning model's performance for supervised tasks, such as classifying cancer subtypes. This optimization process is undertaken using validation sets to evaluate and identify the most effective feature combinations.

View Article and Find Full Text PDF

Unlocking the future of complex human diseases prediction: multi-omics risk score breakthrough.

Front Bioinform

December 2024

Department of Immunology and Molecular Biology, College of Health Sciences, School of Biomedical Sciences, Makerere University, Kampala, Uganda.

View Article and Find Full Text PDF

Neurodegenerative diseases present complex genetic architectures, reflecting a continuum from monogenic to oligogenic and polygenic models. Recent advances in multi-omics data, coupled with systems genetics, have significantly refined our understanding of how these data impact neurodegenerative disease mechanisms. To contextualize these genetic discoveries, we provide a comprehensive critical overview of genetic architecture concepts, from Mendelian inheritance to the latest insights from oligogenic and omnigenic models.

View Article and Find Full Text PDF

Background: The mechanisms underlying the complex relationship between autoimmune hypothyroidism and neurological disorders remain unclear. We conducted a comprehensive analysis of associations between alternative splicing, transcriptomics, and proteomics data and autoimmune hypothyroidism.

Methods: Splicing-Wide association studies (SWAS), proteome-wide association studies (PWAS), and transcriptome-wide association studies (TWAS) were used to identify genes and proteins that regulate autoimmune hypothyroidism within the brain axis.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!