Transcriptome sequencing has led to the widespread identification of long non-coding RNAs (lncRNAs). Subsequently, these genes have been shown to hold functional importance in human cellular biology, which can be exploited by tumors to drive the hallmarks of cancer. Due to the complex tertiary structure and unknown binding motifs of lncRNAs, there is a growing disparity between the number of lncRNAs identified and those that have been functionally characterized. As such, lncRNAs deregulated in cancer may represent critical components of cancer pathways that could serve as novel therapeutic intervention points. Pseudogenes are non-coding DNA sequences that are defunct relatives of their protein-coding parent genes but retain high sequence similarity. Interestingly, certain lncRNAs expressed from pseudogene loci have been shown to regulate the protein-coding parent genes of these pseudogenes in particularly because of this sequence complementarity. We hypothesize that this phenomenon occurs more broadly than previously realized, and that aberrant expression of lncRNAs overlapping pseudogene loci provides an alternative mechanism of cancer gene deregulation. Using RNA-sequencing data from two cohorts of lung adenocarcinoma, each paired with patient-matched non-malignant lung samples, we discovered 104 deregulated pseudogene-derived lncRNAs. Remarkably, many of these deregulated lncRNAs (i) were expressed from the loci of pseudogenes related to known cancer genes, (ii) had expression that significantly correlated with protein-coding parent gene expression, and (iii) had lncRNA protein-coding parent gene expression that was significantly associated with survival. Here, we uncover evidence to suggest the lncRNA-pseudogene-protein-coding gene axis as a prominent mechanism of cancer gene regulation in lung adenocarcinoma, and highlights the clinical utility of exploring the non-coding regions of the cancer transcriptome.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6414417PMC
http://dx.doi.org/10.3389/fgene.2019.00138DOI Listing

Publication Analysis

Top Keywords

protein-coding parent
16
mechanism cancer
12
cancer gene
12
lung adenocarcinoma
12
lncrnas
9
aberrant expression
8
pseudogene-derived lncrnas
8
alternative mechanism
8
cancer
8
gene regulation
8

Similar Publications

A pseudogene is a non-functional copy of a protein-coding gene. Processed pseudogenes, which are created by the reverse transcription of mRNA and subsequent integration of the resulting cDNA into the genome, being a major pseudogene class, represent a significant challenge in genome analysis due to their high sequence similarity to the parent genes and their frequent absence in the reference genome. This homology can lead to errors in variant identification, as sequences derived from processed pseudogenes can be incorrectly assigned to parental genes, complicating correct variant calling.

View Article and Find Full Text PDF

Motivation: Genome-wide association studies (GWAS) have identified genetic variants, usually single-nucleotide polymorphisms (SNPs), associated with human traits, including disease and disease risk. These variants (or causal variants in linkage disequilibrium with them) usually affect the regulation or function of a nearby gene. A GWAS locus can span many genes, however, and prioritizing which gene or genes in a locus are most likely to be causal remains a challenge.

View Article and Find Full Text PDF

Genetics architecture of spontaneous coronary artery dissection in an Italian cohort.

Front Cardiovasc Med

November 2024

Department of Neuroscience, Rehabilitation, Ophthalmology, Genetics, Maternal and Child Health (DINOGMI), University of Genoa, Genoa, Italy.

Spontaneous coronary artery dissection (SCAD) is a relevant non-atherosclerotic cause of acute coronary syndrome with a complex genetic architecture. Recent discoveries have highlighted the potential role of miRNAs and protein-coding genes involved in the processing of small RNAs in the pathogenesis of SCAD. Furthermore, there may be a connection between SCAD and the increased cardiovascular risk observed in fragile X premutation carriers as well as a correlation with pathogenetic variants in genes encoding for collagen and extracellular matrix, which are related to connective tissue disorders (CTDs).

View Article and Find Full Text PDF
Article Synopsis
  • * A deletion in the THRSP gene, related to thyroid hormone response, was found in both wisent and bison genomes but not in other cattle species, suggesting that bison may lack this important protein.
  • * The study illustrates how super-pangenomes can help identify genetic variations linked to traits across species, while also highlighting challenges in accurately assembling genomes from species that have experienced population bottlenecks.
View Article and Find Full Text PDF

Specific learning disorder (SLD) is prevalent worldwide and is a complex disorder with variable symptoms and significant differences among individuals. Epigenetic markers may alter susceptibility to neurodevelopmental disorders (NDDs). Aberrant expression of protein-coding (mRNA) genes in this pathology shows that the detection of epigenetic molecular biomarkers is of increasing importance in the diagnosis and treatment of individuals with SLD.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!