Traveling on discrete embeddings of gene expression.

Artif Intell Med

Microsoft Research, One Microsoft Way, 98052 Redmond, WA, USA.

Published: June 2016

Objective: High-throughput technologies have generated an unprecedented amount of high-dimensional gene expression data. Algorithmic approaches could be extremely useful to distill information and derive compact interpretable representations of the statistical patterns present in the data. This paper proposes a mining approach to extract an informative representation of gene expression profiles based on a generative model called the Counting Grid (CG).

Method: Using the CG model, gene expression values are arranged on a discrete grid, learned in a way that "similar" co-expression patterns are arranged in close proximity, thus resulting in an intuitive visualization of the dataset. More than this, the model permits to identify the genes that distinguish between classes (e.g. different types of cancer). Finally, each sample can be characterized with a discriminative signature - extracted from the model - that can be effectively employed for classification.

Results: A thorough evaluation on several gene expression datasets demonstrate the suitability of the proposed approach from a twofold perspective: numerically, we reached state-of-the-art classification accuracies on 5 datasets out of 7, and similar results when the approach is tested in a gene selection setting (with a stability always above 0.87); clinically, by confirming that many of the genes highlighted by the model as significant play also a key role for cancer biology.

Conclusion: The proposed framework can be successfully exploited to meaningfully visualize the samples; detect medically relevant genes; properly classify samples.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.artmed.2016.05.002DOI Listing

Publication Analysis

Top Keywords

gene expression
20
gene
6
expression
5
model
5
traveling discrete
4
discrete embeddings
4
embeddings gene
4
expression objective
4
objective high-throughput
4
high-throughput technologies
4

Similar Publications

Pulmonary hypertension (PH) increases the mortality of preterm infants with bronchopulmonary dysplasia (BPD). There are no curative therapies for this disease. Lung endothelial carnitine palmitoyltransferase 1a (Cpt1a), the rate-limiting enzyme of the carnitine shuttle system, is reduced in a rodent model of BPD.

View Article and Find Full Text PDF

Protocol to generate a 3D atherogenesis-on-chip model for studying endothelial-macrophage crosstalk in atherogenesis.

STAR Protoc

January 2025

Department of Experimental Vascular Medicine, Amsterdam UMC, location AMC, Meibergdreef 9, Amsterdam, the Netherlands; Amsterdam Cardiovascular Sciences, Atherosclerosis & Ischemic Syndromes, Amsterdam, the Netherlands; Laboratory of Angiogenesis and Vascular Metabolism, VIB-KU Leuven Center for Cancer Biology, VIB, 3000 Leuven, Belgium; Laboratory of Angiogenesis and Vascular Metabolism, Department of Oncology, KU Leuven and Leuven Cancer Institute (LKI), 3000 Leuven, Belgium. Electronic address:

The endothelium is the gatekeeper of vessel health, and its dysfunction is pivotal in driving atherogenesis. Here, we present a protocol to replicate endothelial-macrophage crosstalk during atherogenesis, called the "atherogenesis-on-chip" model, based on the Emulate dual-channel perfusion system. We describe a model for studying endothelial-macrophage interactions during atherogenesis in human aortic endothelial cells and human macrophages using qPCR and secretome analysis, fluorescence microscopy, and flow cytometry.

View Article and Find Full Text PDF

Angiogenesis begins as endothelial cells migrate, forming a sprouting tip and subsequent growth-rich stalk cells. Here, we present a protocol for transcriptomic and epigenomic analyses of tip-like cells in cultured endothelial cells. We describe steps for stimulating human umbilical vein endothelial cells (HUVECs) with vascular endothelial cell growth factor (VEGF) to generate tip-like cells.

View Article and Find Full Text PDF

Cadmium (Cd) is a toxic heavy metal which induces vascular disorders. Previous studies suggest that Cd in the bloodstream affects vascular endothelial cells (ECs), potentially contributing to vascular-related diseases. However, the molecular mechanisms of effects of Cd on ECs remain poorly understood.

View Article and Find Full Text PDF

In the present study, we identified 22 significant SNPs, eight stable QTLs and 17 potential candidate genes associated with 100-seed weight in soybean. Soybean is an economically important crop that is rich in seed oil and protein. The 100-seed weight (HSW) is a crucial yield contributing trait.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!