Transcriptome sequencing has led to the widespread identification of long non-coding RNAs (lncRNAs). Subsequently, these genes have been shown to hold functional importance in human cellular biology, which can be exploited by tumors to drive the hallmarks of cancer. Due to the complex tertiary structure and unknown binding motifs of lncRNAs, there is a growing disparity between the number of lncRNAs identified and those that have been functionally characterized. As such, lncRNAs deregulated in cancer may represent critical components of cancer pathways that could serve as novel therapeutic intervention points. Pseudogenes are non-coding DNA sequences that are defunct relatives of their protein-coding parent genes but retain high sequence similarity. Interestingly, certain lncRNAs expressed from pseudogene loci have been shown to regulate the protein-coding parent genes of these pseudogenes in particularly because of this sequence complementarity. We hypothesize that this phenomenon occurs more broadly than previously realized, and that aberrant expression of lncRNAs overlapping pseudogene loci provides an alternative mechanism of cancer gene deregulation. Using RNA-sequencing data from two cohorts of lung adenocarcinoma, each paired with patient-matched non-malignant lung samples, we discovered 104 deregulated pseudogene-derived lncRNAs. Remarkably, many of these deregulated lncRNAs (i) were expressed from the loci of pseudogenes related to known cancer genes, (ii) had expression that significantly correlated with protein-coding parent gene expression, and (iii) had lncRNA protein-coding parent gene expression that was significantly associated with survival. Here, we uncover evidence to suggest the lncRNA-pseudogene-protein-coding gene axis as a prominent mechanism of cancer gene regulation in lung adenocarcinoma, and highlights the clinical utility of exploring the non-coding regions of the cancer transcriptome.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6414417 | PMC |
http://dx.doi.org/10.3389/fgene.2019.00138 | DOI Listing |
Int J Mol Sci
January 2025
Federal Research Center for Innovator and Emerging Biomedical and Pharmaceutical Technologies, 125315 Moscow, Russia.
A pseudogene is a non-functional copy of a protein-coding gene. Processed pseudogenes, which are created by the reverse transcription of mRNA and subsequent integration of the resulting cDNA into the genome, being a major pseudogene class, represent a significant challenge in genome analysis due to their high sequence similarity to the parent genes and their frequent absence in the reference genome. This homology can lead to errors in variant identification, as sequences derived from processed pseudogenes can be incorrectly assigned to parental genes, complicating correct variant calling.
View Article and Find Full Text PDFPLoS Comput Biol
January 2025
Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland, United States of America.
Motivation: Genome-wide association studies (GWAS) have identified genetic variants, usually single-nucleotide polymorphisms (SNPs), associated with human traits, including disease and disease risk. These variants (or causal variants in linkage disequilibrium with them) usually affect the regulation or function of a nearby gene. A GWAS locus can span many genes, however, and prioritizing which gene or genes in a locus are most likely to be causal remains a challenge.
View Article and Find Full Text PDFFront Cardiovasc Med
November 2024
Department of Neuroscience, Rehabilitation, Ophthalmology, Genetics, Maternal and Child Health (DINOGMI), University of Genoa, Genoa, Italy.
Spontaneous coronary artery dissection (SCAD) is a relevant non-atherosclerotic cause of acute coronary syndrome with a complex genetic architecture. Recent discoveries have highlighted the potential role of miRNAs and protein-coding genes involved in the processing of small RNAs in the pathogenesis of SCAD. Furthermore, there may be a connection between SCAD and the increased cardiovascular risk observed in fragile X premutation carriers as well as a correlation with pathogenetic variants in genes encoding for collagen and extracellular matrix, which are related to connective tissue disorders (CTDs).
View Article and Find Full Text PDFCommun Biol
November 2024
Animal Genomics, ETH Zurich, Zurich, Switzerland.
J Mol Neurosci
November 2024
Department of Child and Adolescent Psychiatry, Faculty of Medicine, Sivas Cumhuriyet University, Sivas, Turkey.
Specific learning disorder (SLD) is prevalent worldwide and is a complex disorder with variable symptoms and significant differences among individuals. Epigenetic markers may alter susceptibility to neurodevelopmental disorders (NDDs). Aberrant expression of protein-coding (mRNA) genes in this pathology shows that the detection of epigenetic molecular biomarkers is of increasing importance in the diagnosis and treatment of individuals with SLD.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!