A Novel Method for Identifying the Potential Cancer Driver Genes Based on Molecular Data Integration.

Biochem Genet

College of Computer Science and Electronics Engineering, Hunan University, Changsha, 410082, Hunan, China.

Published: February 2020

The identification of the cancer driver genes is essential for personalized therapy. The mutation frequency of most driver genes is in the middle (2-20%) or even lower range, which makes it difficult to find the driver genes with low-frequency mutations. Other forms of genomic aberrations, such as copy number variations (CNVs) and epigenetic changes, may also reflect cancer progression. In this work, a method for identifying the potential cancer driver genes (iPDG) based on molecular data integration is proposed. DNA copy number variation, somatic mutation, and gene expression data of matched cancer samples are integrated. In combination with the method of iKEEG, the "key genes" of cancer are identified, and the change in their expression levels is used for auxiliary evaluation of whether the mutated genes are potential drivers. For a mutated gene, the concept of mutational effect is defined, which takes into account the effects of copy number variation, mutation gene itself, and its neighbor genes. The method mainly includes two steps: the first step is data preprocessing. First, DNA copy number variation and somatic mutation data are integrated. Then, the integrated data are mapped to a given interaction network, and the diffusion kernel is used to form the mutation effect matrix. The second step is to obtain the key genes by using the iKGGE method, and construct the connection matrix by means of the gene expression data of the key genes and mutation impact matrix of the mutated genes. Experiments on TCGA breast cancer and Glioblastoma multiforme datasets demonstrate that iPDG is effective not only to identify the known cancer driver genes but also to discover the rare potential driver genes. When measured by functional enrichment analysis, we find that these genes are clearly associated with these two types of cancers.

Download full-text PDF

Source
http://dx.doi.org/10.1007/s10528-019-09924-2DOI Listing

Publication Analysis

Top Keywords

driver genes
28
cancer driver
16
copy number
16
genes
13
number variation
12
method identifying
8
identifying potential
8
cancer
8
potential cancer
8
based molecular
8

Similar Publications

This study enrolled 10 patients diagnosed with premalignant lesions and early-stage gastric cardia adenocarcinoma (GCA), confirmed through endoscopic examination. These patients were subjected to next-generation sequencing (NGS) using a customized 1123-gene panel to identify genetic alterations and signaling pathways. The results were compared to stage IIB to IV GCA samples from the cancer genome atlas (TCGA) and a cohort of Hong Kong patients.

View Article and Find Full Text PDF

Hepatic stellate cells (HSCs) are key drivers of local fibrosis. Adiponectin, conventionally thought of as an adipokine, is also expressed in quiescent HSCs. However, the impact of its local expression on the progression of liver fibrosis remains unclear.

View Article and Find Full Text PDF

Transposable elements (TEs) are significant drivers of genome evolution, yet their recent dynamics and impacts within and among species, as well as the roles of host genes and non-coding RNAs in the transposition process, remain elusive. With advancements in large-scale pan-genome sequencing and the development of open data sharing, large-scale comparative genomics studies have become feasible. Here, we performed complete de novo TE annotations and identified active TEs in 310 plant genome assemblies across 119 species and seven crop populations.

View Article and Find Full Text PDF

Aging and chronic inflammation are associated with overabundant myeloid-primed multipotent progenitors (MPPs) amongst hematopoietic stem and progenitor cells (HSPCs). While HSC differentiation bias has been considered a primary cause of myeloid bias, whether it is sufficient has not been quantitatively evaluated. Here, we analyzed bone marrow data from the IκB- (Nfkbia+/-Nfkbib-/-Nfkbie-/-) mouse model of inflammation with elevated NFκB activity, which shows increased myeloid-biased MPPs.

View Article and Find Full Text PDF

Lung cancer is one of the most frequently diagnosed cancers in the US. African-American (AA) men are more likely to develop lung cancer with higher incidence and mortality rates than European-American (EA) men. Herein, we report high-confidence alternative splicing (AS) events from high-throughput, high-depth total RNA sequencing of lung tumors and non-tumor adjacent tissues (NATs) in two independent cohorts of patients with adenocarcinoma (LUAD) and squamous cell carcinoma (LUSC).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!