Wiley Interdiscip Rev RNA
April 2024
For a long time, it was believed that new genes arise only from modifications of preexisting genes, but the discovery of de novo protein-coding genes that originated from noncoding DNA regions demonstrates the existence of a "motherless" origination process for new genes. However, the features, distributions, expression profiles, and origin modes of these genes in humans seem to support the notion that their origin is not a purely "motherless" process; rather, these genes arise preferentially from genomic regions encoding preexisting precursors with gene-like features. In such a case, the gene loci are typically not brand new.
View Article and Find Full Text PDFWhile -methyldeoxyadenine (6mA) modification is a fundamental regulation in prokaryotes, its prevalence and functions in eukaryotes are controversial. Here, we report 6mA-Sniper to quantify 6mA sites in eukaryotes at single-nucleotide resolution, and delineate a 6mA profile in with 2034 sites. Twenty-six of 39 events with Mnl I restriction endonuclease sites were verified, demonstrating the feasibility of this method.
View Article and Find Full Text PDFNewly originated de novo genes have been linked to the formation and function of the human brain. However, how a specific gene originates from ancestral noncoding DNAs and becomes involved in the preexisting network for functional outcomes remains elusive. Here, a human-specific de novo gene, SP0535, is identified that is preferentially expressed in the ventricular zone of the human fetal brain and plays an important role in cortical development and function.
View Article and Find Full Text PDFHuman de novo genes can originate from neutral long non-coding RNA (lncRNA) loci and are evolutionarily significant in general, yet how and why this all-or-nothing transition to functionality happens remains unclear. Here, in 74 human/hominoid-specific de novo genes, we identified distinctive U1 elements and RNA splice-related sequences accounting for RNA nuclear export, differentiating mRNAs from lncRNAs, and driving the origin of de novo genes from lncRNA loci. The polymorphic sites facilitating the lncRNA-mRNA conversion through regulating nuclear export are selectively constrained, maintaining a boundary that differentiates mRNAs from lncRNAs.
View Article and Find Full Text PDFRhesus macaque is a unique nonhuman primate model for human evolutionary and translational study, but the error-prone gene models critically limit its applications. Here, we de novo defined full-length macaque gene models based on single molecule, long-read transcriptome sequencing in four macaque tissues (frontal cortex, cerebellum, heart and testis). Overall, 8 588 227 poly(A)-bearing complementary DNA reads with a mean length of 14 106 nt were generated to compile the backbone of macaque transcripts, with the fine-scale structures further refined by RNA sequencing and cap analysis gene expression sequencing data.
View Article and Find Full Text PDFBackground: Recent studies have revealed thousands of A-to-I RNA editing events in primates, but the origination and general functions of these events are not well addressed.
Results: Here, we perform a comparative editome study in human and rhesus macaque and uncover a substantial proportion of macaque A-to-I editing sites that are genomically polymorphic in some animals or encoded as non-editable nucleotides in human. The occurrence of these recent gain and loss of RNA editing through DNA point mutation is significantly more prevalent than that expected for the nearby regions.
Nucleosomal modifications have been implicated in fundamental epigenetic regulation, but the roles of nucleosome occupancy in shaping changes through evolution remain to be addressed. Here we present high-resolution nucleosome occupancy profiles for multiple tissues derived from human, macaque, tree shrew, mouse, and pig. Genome-wide comparison reveals conserved nucleosome occupancy profiles across both different species and tissue types.
View Article and Find Full Text PDF