New protein-coding genes can evolve from previously noncoding genomic regions through a process known as de novo gene emergence. Evidence suggests that this process has likely occurred throughout evolution and across the tree of life. Yet, confidently identifying de novo emerged genes remains challenging.
View Article and Find Full Text PDFSpecies-specific genes, also known as orphans, are ubiquitous across life's domains. In prokaryotes, species-specific orphan genes (SSOGs) are mostly thought to originate in external elements such as viruses followed by horizontal gene transfer, whereas the scenario of native origination, through rapid divergence or de novo, is mostly dismissed. However, quantitative evidence supporting either scenario is lacking.
View Article and Find Full Text PDFGene expression is an essential step in the translation of genotypes into phenotypes. However, little is known about the transcriptome architecture and the underlying genetic effects at the species level. Here we generated and analyzed the pan-transcriptome of ~1,000 yeast natural isolates across 4,977 core and 1,468 accessory genes.
View Article and Find Full Text PDFCasposons are transposable elements containing the CRISPR associated gene Cas1solo. Identified in many archaeal genomes, casposons are discussed as the origin of CRISPR-Cas systems due to their proposed Cas1solo-dependent translocation. However, apart from bioinformatic approaches and the demonstration of Cas1solo integrase and endonuclease activity in vitro, casposon transposition has not yet been shown in vivo.
View Article and Find Full Text PDFIntergenic genomic regions have essential regulatory and structural roles that impose constraints on their sequences. But regions that do not currently encode proteins also carry the potential to do so in the future. De novo gene emergence, the evolution of novel genes out of previously noncoding sequences has now been established as a potent force for genomic novelty.
View Article and Find Full Text PDFSmall open reading frames (sORFs) can encode functional "microproteins" that perform crucial biological tasks. However, their size makes them less amenable to genomic analysis, and their origins and conservation are poorly understood. Given their short length, it is plausible that some of these functional microproteins have recently originated entirely de novo from noncoding sequences.
View Article and Find Full Text PDFNearly one third of protein coding sequences correspond to duplicate genes, equally split between small-scale duplicates (SSD) and whole-genome duplicates (WGD). While duplicate genes have distinct properties compared to singletons, to date, there has been no systematic analysis of their positional preferences. In this work, we show that SSD and WGD genes are organized in distinct gene clusters that occupy different genomic regions, with SSD being more peripheral and WGD more centrally positioned close to centromeric chromatin.
View Article and Find Full Text PDFHigh-throughput sequencing (HTS) technologies and bioinformatic analyses are of growing interest to be used as a routine diagnostic tool in the field of plant viruses. The reliability of HTS workflows from sample preparation to data analysis and results interpretation for plant virus detection and identification must be evaluated (verified and validated) to approve this tool for diagnostics. Many different extraction methods, library preparation protocols, and sequence and bioinformatic pipelines are available for virus sequence detection.
View Article and Find Full Text PDFThe Plasma membrane Cation binding Protein 1 (PCaP1) has been shown to be important for the intra-cellular movement of two members of the Potyvirus genus in arabidopsis and tobacco plants. In this study, the orthologous PCaP1 gene of pepper (Capsicum annuum) was examined for its role in the accumulation of Potato virus Y, type member of the Potyvirus. Downregulation of C.
View Article and Find Full Text PDFThe origin of 'orphan' genes, species-specific sequences that lack detectable homologues, has remained mysterious since the dawn of the genomic era. There are two dominant explanations for orphan genes: complete sequence divergence from ancestral genes, such that homologues are not readily detectable; and de novo emergence from ancestral non-genic sequences, such that homologues genuinely do not exist. The relative contribution of the two processes remains unknown.
View Article and Find Full Text PDFRecent evidence demonstrates that novel protein-coding genes can arise de novo from non-genic loci. This evolutionary innovation is thought to be facilitated by the pervasive translation of non-genic transcripts, which exposes a reservoir of variable polypeptides to natural selection. Here, we systematically characterize how these de novo emerging coding sequences impact fitness in budding yeast.
View Article and Find Full Text PDFThe sequencing of over a thousand Saccharomyces cerevisiae genomes revealed a complex pangenome. Over one third of the discovered genes are not present in the S. cerevisiae core genome but instead are often restricted to a subset of yeast isolates and thus may be important for adaptation to specific environmental niches.
View Article and Find Full Text PDFDe novo genes, that is, protein-coding genes originating from previously noncoding sequence, have gone from being considered impossibly unlikely to being recognized as an important source of genetic novelty in eukaryotic genomes. It is clear that de novo gene evolution is a rare but consistent feature of eukaryotic genomes, being detected in every genome studied. However, different studies often use different computational methods, and the numbers and identities of the detected genes vary greatly.
View Article and Find Full Text PDFReconstructing genome history is complex but necessary to reveal quantitative principles governing genome evolution. Such reconstruction requires recapitulating into a single evolutionary framework the evolution of genome architecture and gene repertoire. Here, we reconstructed the genome history of the genus Lachancea that appeared to cover a continuous evolutionary range from closely related to more diverged yeast species.
View Article and Find Full Text PDF