Heterogeneous Multi-Layered Network Model for Omics Data Integration and Analysis.

Front Genet

Ph.D. Program in Computer Science, The City University of New York, New York, NY, United States.

Published: January 2020

Advances in next-generation sequencing and high-throughput techniques have enabled the generation of vast amounts of diverse omics data. These big data provide an unprecedented opportunity in biology, but impose great challenges in data integration, data mining, and knowledge discovery due to the complexity, heterogeneity, dynamics, uncertainty, and high-dimensionality inherited in the omics data. Network has been widely used to represent relations between entities in biological system, such as protein-protein interaction, gene regulation, and brain connectivity (i.e. network construction) as well as to infer novel relations given a reconstructed network (aka link prediction). Particularly, heterogeneous multi-layered network (HMLN) has proven successful in integrating diverse biological data for the representation of the hierarchy of biological system. The HMLN provides unparalleled opportunities but imposes new computational challenges on establishing causal genotype-phenotype associations and understanding environmental impact on organisms. In this review, we focus on the recent advances in developing novel computational methods for the inference of novel biological relations from the HMLN. We first discuss the properties of biological HMLN. Then we survey four categories of state-of-the-art methods (matrix factorization, random walk, knowledge graph, and deep learning). Thirdly, we demonstrate their applications to omics data integration and analysis. Finally, we outline strategies for future directions in the development of new HMLN models.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6997577PMC
http://dx.doi.org/10.3389/fgene.2019.01381DOI Listing

Publication Analysis

Top Keywords

omics data
16
data integration
12
heterogeneous multi-layered
8
multi-layered network
8
data
8
integration analysis
8
biological system
8
network
5
biological
5
hmln
5

Similar Publications

Understanding the molecular landscape of nonmuscle-invasive bladder cancer (NMIBC) is essential to improve risk assessment and treatment regimens. We performed a comprehensive genomic analysis of patients with NMIBC using whole-exome sequencing (n = 438), shallow whole-genome sequencing (n = 362) and total RNA sequencing (n = 414). A large genomic variation within NMIBC was observed and correlated with different molecular subtypes.

View Article and Find Full Text PDF

Spatially resolved omics (SRO) technologies enable the identification of cell types while preserving their organization within tissues. Application of such technologies offers the opportunity to delineate cell-type spatial relationships, particularly across different length scales, and enhance our understanding of tissue organization and function. To quantify such multi-scale cell-type spatial relationships, we present CRAWDAD, Cell-type Relationship Analysis Workflow Done Across Distances, as an open-source R package.

View Article and Find Full Text PDF

The extensive application of graphene nanosheets (GNSs) has raised concerns over risks to sensitive species in the aquatic environment. The humic acid (HA) corona is traditionally considered to reduce GNSs toxicity. Here, we evaluate the effect of sorbed HA (GNSs-HA) on the toxicity of GNSs to Gram positive Bacillus tropicus.

View Article and Find Full Text PDF

Glioma is characterized by high heterogeneity and poor prognosis. Attempts have been made to understand its diversity in both genetic expressions and radiomic characteristics, while few integrated the two omics in predicting survival of glioma. This study was intended to investigate the connection between glioma imaging and genome, and examine its predictive value in glioma mortality risk and tumor immune microenvironment (TIME).

View Article and Find Full Text PDF

Background: The recent European-ancestry based genome-wide association study (GWAS) of Alzheimer disease (AD) by Bellenguez2022 has identified 75 significant genetic loci, but only a few have been functionally mapped to effector gene level. Besides the large-scale RNA expression, protein and metabolite levels are key molecular traits bridging the genetic variants to AD risk, and thus we decided to integrate them into the genetic analysis to pinpoint key proteins and metabolites underlying AD etiology. Few studies have generated more than one layer of post-transcriptional phenotypes, limiting the scale of biological translation of disease modifying treatments.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!