Meta-analytic support vector machine for integrating multiple omics data.

BioData Min

Department of Statistics, Korea University, Anam-dong, Seoul, 136-701 South Korea.

Published: January 2017

Background: Of late, high-throughput microarray and sequencing data have been extensively used to monitor biomarkers and biological processes related to many diseases. Under this circumstance, the support vector machine (SVM) has been popularly used and been successful for gene selection in many applications. Despite surpassing benefits of the SVMs, single data analysis using small- and mid-size of data inevitably runs into the problem of low reproducibility and statistical power. To address this problem, we propose a meta-analytic support vector machine (Meta-SVM) that can accommodate multiple omics data, making it possible to detect consensus genes associated with diseases across studies.

Results: Experimental studies show that the Meta-SVM is superior to the existing meta-analysis method in detecting true signal genes. In real data applications, diverse omics data of breast cancer (TCGA) and mRNA expression data of lung disease (idiopathic pulmonary fibrosis; IPF) were applied. As a result, we identified gene sets consistently associated with the diseases across studies. In particular, the ascertained gene set of TCGA omics data was found to be significantly enriched in the ABC transporters pathways well known as critical for the breast cancer mechanism.

Conclusion: The Meta-SVM effectively achieves the purpose of meta-analysis as jointly leveraging multiple omics data, and facilitates identifying potential biomarkers and elucidating the disease process.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5270233PMC
http://dx.doi.org/10.1186/s13040-017-0126-8DOI Listing

Publication Analysis

Top Keywords

omics data
20
support vector
12
vector machine
12
multiple omics
12
data
10
meta-analytic support
8
associated diseases
8
breast cancer
8
omics
5
machine integrating
4

Similar Publications

Clear cell renal cell carcinoma (ccRCC) is a highly malignant tumor characterized by a significant propensity for recurrence and metastasis. DNA methylation has emerged as a critical epigenetic mechanism with substantial utility in cancer diagnosis. In this study, multi-omics data were utilized to investigate the target genes regulated by the transcription factor MYC-associated zinc finger protein (MAZ) in ccRCC, leading to the identification of thymidine phosphorylase (TYMP) as a gene with notably elevated expression in ccRCC.

View Article and Find Full Text PDF

The HoloFood project used a hologenomic approach to understand the impact of host-microbiota interactions on salmon and chicken production by analysing multiomic data, phenotypic characteristics, and associated metadata in response to novel feeds. The project's raw data, derived analyses, and metadata are deposited in public, open archives (BioSamples, European Nucleotide Archive, MetaboLights, and MGnify), so making use of these diverse data types may require access to multiple resources. This is especially complex where analysis pipelines produce derived outputs such as functional profiles or genome catalogues.

View Article and Find Full Text PDF

Transcriptomics and Proteomics Analysis of the Liver of Knockout Mice.

Int J Mol Sci

January 2025

State Key Laboratory of Pathogen and Biosecurity, Academy of Military Medical Sciences, Beijing 100071, China.

RAD52 plays crucial roles in several aspects of mammalian cells, including DNA double-strand breaks repair, viral infection, cancer development, and antibody class switching. To comprehensively elucidate the role of RAD52 in maintaining genome stability and uncover additional functions of RAD52 in mammals, we performed the transcriptomics and proteomics analysis of the liver of knockout mice. Transcriptomics analysis reveals overexpression of mitochondrial genes in the liver of knockout (RAD52KO) mice.

View Article and Find Full Text PDF

Neuroblastoma is a common malignant tumor in childhood that seriously endangers the health and lives of children, making it essential to find effective prognostic markers to accurately predict their clinical outcomes. The development of high-throughput technology in the biomedical field has made it possible to obtain multi-omics data, whose integration can compensate for missing or unreliable information in a single data source. In this study, we integrated clinical data and two omics data, i.

View Article and Find Full Text PDF

Hepatocellular carcinoma (HCC) is a highly heterogeneous cancer with a poor prognosis. During the development of cancer cells, mitochondria influence various cell death patterns by regulating metabolic pathways such as oxidative phosphorylation. However, the relationship between mitochondrial function and cell death patterns in HCC remains unclear.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!