We propose a novel two-stage analysis strategy to discover candidate genes associated with the particular cancer outcomes in large multimodal genomic cancers databases, such as The Cancer Genome Atlas (TCGA). During the first stage, we use mixed mutual information to perform variable selection; during the second stage, we use scalable Bayesian network (BN) modeling to identify candidate genes and their interactions. Two crucial features of the proposed approach are (i) the ability to handle mixed data types (continuous and discrete, genomic, epigenomic, etc.) and (ii) a flexible boundary between the variable selection and network modeling stages - the boundary that can be adjusted in accordance with the investigators' BN software scalability and hardware implementation. These two aspects result in high generalizability of the proposed analytical framework. We apply the above strategy to three different TCGA datasets (LGG, Brain Lower Grade Glioma; HNSC, Head and Neck Squamous Cell Carcinoma; STES, Stomach and Esophageal Carcinoma), linking multimodal molecular information (SNPs, mRNA expression, DNA methylation) to two clinical outcome variables (tumor status and patient survival). We identify 11 candidate genes, of which 6 have already been directly implicated in the cancer literature. One novel LGG prognostic factor suggested by our analysis, methylation of TMPRSS11F type II transmembrane serine protease, presents intriguing direction for the follow-up studies.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7314938 | PMC |
http://dx.doi.org/10.3389/fgene.2020.00648 | DOI Listing |
Sleep Breath
January 2025
Department of Respiratory and Critical Care Medicine, Medical School of Nantong University, Nantong Key Laboratory of Respiratory Medicine, Affiliated Hospital of Nantong University, Nantong, 226001, China.
Background: The pathophysiology of obstructive sleep apnea (OSA) and diabetes mellitus (DM) is still unknown, despite clinical reports linking the two conditions. After investigating potential roles for DM-related genes in the pathophysiology of OSA, our goal is to investigate the molecular significance of the condition. Machine learning is a useful approach to understanding complex gene expression data to find biomarkers for the diagnosis of OSA.
View Article and Find Full Text PDFAppl Microbiol Biotechnol
January 2025
Department of Microbiology and Biotechnology, Institute of Plant Science and Microbiology, University of Hamburg, Ohnhorststr.18, 22609, Hamburg, Germany.
The focus on microalgae for applications in several fields, e.g. resources for biofuel, the food industry, cosmetics, nutraceuticals, biotechnology, and healthcare, has gained increasing attention over the last decades.
View Article and Find Full Text PDFPlant Genome
March 2025
Key Laboratory of Plant Functional Genomics of the Ministry of Education/Jiangsu Key Laboratory of Crop Genomics and Molecular Breeding/Jiangsu Co-Innovation Center for Modern Production Technology of Grain Crops/Joint International Research Laboratory of Agriculture and Agri-Product Safety of Ministry of Education of China, Yangzhou University, Yangzhou, China.
Winter barley (Hordeum vulgare) production areas in the middle and lower reaches of the Yangtze River are severely threatened by barley yellow mosaic disease, which is caused by Barley yellow mosaic virus and Barley mild mosaic virus. Improving barley disease resistance in breeding programs requires knowledge of genetic loci in germplasm resources. In this study, bulked segregant analysis (BSA) identified a novel major quantitative trait loci (QTL) QRym.
View Article and Find Full Text PDFCytotechnology
February 2025
Department of Neurology, Hubei Provincial Hospital of Integrated Traditional and Western Medicine, Jianghan District, No. 11 Lingjiaohu Road, Wuhan, 430015 China.
Unlabelled: Alzheimer's disease (AD) is a progressive neurological condition that causes brain shrinkage and cell death. This study aimed to identify the role of the NORAD/miR-26b-5p axis in AD. StarBase was used to examine the binding sequences of miR-26b-5p to LncRNA NORAD or its target genes, which were verified by a double luciferase reporter assay.
View Article and Find Full Text PDFChoosing the appropriate reference genes for quantitative real-time PCR (qRT-PCR) is very important for accurately evaluating expression of target genes. L. is a widely used horticultural plant with high ornamental value, which also shows a strong ability to tolerate abiotic stresses.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!