Unsupervised deep learning reveals prognostically relevant subtypes of glioblastoma.

BMC Bioinformatics

Department of Biomedical Informatics, University of Pittsburgh, 5607 Baum Blvd, Pittsburgh, PA, 15206, USA.

Published: October 2017

Background: One approach to improving the personalized treatment of cancer is to understand the cellular signaling transduction pathways that cause cancer at the level of the individual patient. In this study, we used unsupervised deep learning to learn the hierarchical structure within cancer gene expression data. Deep learning is a group of machine learning algorithms that use multiple layers of hidden units to capture hierarchically related, alternative representations of the input data. We hypothesize that this hierarchical structure learned by deep learning will be related to the cellular signaling system.

Results: Robust deep learning model selection identified a network architecture that is biologically plausible. Our model selection results indicated that the 1st hidden layer of our deep learning model should contain about 1300 hidden units to most effectively capture the covariance structure of the input data. This agrees with the estimated number of human transcription factors, which is approximately 1400. This result lends support to our hypothesis that the 1st hidden layer of a deep learning model trained on gene expression data may represent signals related to transcription factor activation. Using the 3rd hidden layer representation of each tumor as learned by our unsupervised deep learning model, we performed consensus clustering on all tumor samples-leading to the discovery of clusters of glioblastoma multiforme with differential survival. One of these clusters contained all of the glioblastoma samples with G-CIMP, a known methylation phenotype driven by the IDH1 mutation and associated with favorable prognosis, suggesting that the hidden units in the 3rd hidden layer representations captured a methylation signal without explicitly using methylation data as input. We also found differentially expressed genes and well-known mutations (NF1, IDH1, EGFR) that were uniquely correlated with each of these clusters. Exploring these unique genes and mutations will allow us to further investigate the disease mechanisms underlying each of these clusters.

Conclusions: In summary, we show that a deep learning model can be trained to represent biologically and clinically meaningful abstractions of cancer gene expression data. Understanding what additional relationships these hidden layer abstractions have with the cancer cellular signaling system could have a significant impact on the understanding and treatment of cancer.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5629551PMC
http://dx.doi.org/10.1186/s12859-017-1798-2DOI Listing

Publication Analysis

Top Keywords

deep learning
36
learning model
20
hidden layer
20
unsupervised deep
12
cellular signaling
12
gene expression
12
expression data
12
hidden units
12
learning
10
treatment cancer
8

Similar Publications

This dataset contains demographic, morphological and pathological data, endoscopic images and videos of 191 patients with colorectal polyps. Morphological data is included based on the latest international gastroenterology classification references such as Paris, Pit and JNET classification. Pathological data includes the diagnosis of the polyps including Tubular, Villous, Tubulovillous, Hyperplastic, Serrated, Inflammatory and Adenocarcinoma with Dysplasia Grade & Differentiation.

View Article and Find Full Text PDF

Long non-coding RNAs (lncRNAs) play crucial roles in numerous biological processes and are involved in complex human diseases through interactions with proteins. Accurate identification of lncRNA-protein interactions (LPI) can help elucidate the functional mechanisms of lncRNAs and provide scientific insights into the molecular mechanisms underlying related diseases. While many sequence-based methods have been developed to predict LPIs, efficiently extracting and effectively integrating potential feature information that reflects functional attributes from lncRNA and protein sequences remains a significant challenge.

View Article and Find Full Text PDF

Multi-Energy Evaluation of Image Quality in Spectral CT Pulmonary Angiography Using Different Strength Deep Learning Spectral Reconstructions.

Acad Radiol

December 2024

Radiomics and Augmented Intelligence Laboratory (RAIL), Department of Radiology and the Norman Fixel Institute for Neurological Diseases, University of Florida College of Medicine, Gainesville, FL (M.H-S., H.S.S., A.G.R., S.E.M., J.C.P., E.Y.A., B.H., R.F.); Department of Radiology, University of Florida College of Medicine, Gainesville, FL (M.H-S., H.S.S., A.G.R., J.C.P., E.Y.A., B.H., R.F.); Division of Medical Physics, University of Florida College of Medicine, Gainesville, FL (R.F.); Department of Neurology, Division of Movement Disorders, University of Florida College of Medicine, Gainesville, FL (R.F.); Department of Otolaryngology - Head and Neck Surgery, McGill University, Montreal, Quebec, Canada (R.F.); Department of Radiology, AdventHealth Medical Group, Maitland, FL (R.F.). Electronic address:

Rationale And Objectives: To evaluate and compare image quality of different energy levels of virtual monochromatic images (VMIs) using standard versus strong deep learning spectral reconstruction (DLSR) on dual-energy CT pulmonary angiogram (DECT-PA).

Materials And Methods: A retrospective study was performed on 70 patients who underwent DECT-PA (15 PE present; 55 PE absent) scans. VMIs were reconstructed at different energy levels ranging from 35 to 200 keV using standard and strong levels with deep learning spectral reconstruction.

View Article and Find Full Text PDF

Computational Pathology Detection of Hypoxia-Induced Morphological Changes in Breast Cancer.

Am J Pathol

December 2024

Department of Computer Science, Faculty of Engineering Sciences, University College London, Gower Street, London, WC1E 6BT, United Kingdom.

Understanding the tumor hypoxic microenvironment is crucial for grasping tumor biology, clinical progression, and treatment responses. This study presents a novel application of AI in computational histopathology to evaluate hypoxia in breast cancer. Weakly Supervised Deep Learning (WSDL) models can accurately detect morphological changes associated with hypoxia in routine Hematoxylin and Eosin (H&E) whole slide images (WSI).

View Article and Find Full Text PDF

DYT-THAP1 dystonia is a monogenetic form of dystonia, a movement disorder characterized by the involuntary co-contraction of agonistic and antagonistic muscles. The disease is caused by mutations in the THAP1 gene, although the precise mechanisms by which these mutations contribute to the pathophysiology of dystonia remain unclear. The incomplete penetrance of DYT-THAP1 dystonia, estimated at 40 to 60 %, suggests that an environmental trigger may be required for the manifestation of the disease in genetically predisposed individuals.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!