A correct genome annotation is fundamental for research in the field of molecular and structural biology. The annotation of the reference genome of has been reported previously, but it is essentially limited to open reading frames (ORFs) of protein coding genes and contains only a few noncoding transcripts. In this study, we identified and annotated full-length transcripts of by deep RNA sequencing. We annotated 7044 coding genes and 4567 noncoding genes. Astonishingly, 23% of the coding genes are alternatively spliced. We identified 679 novel coding genes as well as 2878 novel noncoding genes and corrected the structural organization of more than 50% of the previously annotated genes. Furthermore, we substantially extended the Gene Ontology (GO) and Enzyme Commission (EC) lists, which provide comprehensive search tools for potential industrial applications and basic research. The identified novel transcripts and improved annotation will help to understand the gene regulatory landscape in The analysis pipeline developed here can be used to build transcriptome assemblies and identify coding and noncoding RNAs of other species.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8535861PMC
http://dx.doi.org/10.3390/genes12101549DOI Listing

Publication Analysis

Top Keywords

coding genes
16
noncoding genes
8
genes
7
coding
5
global transcriptome
4
transcriptome characterization
4
characterization assembly
4
assembly thermophilic
4
thermophilic ascomycete
4
ascomycete correct
4

Similar Publications

Female mosquitoes require a vertebrate blood meal to activate reproduction, transmitting numerous devastating human diseases. Vitellogenesis is a central event of female reproduction that involves the massive production of vitellogenin (Vg) in the fat body and the maturation of ovaries. This process is controlled by the steroid hormone 20-hydroxyecdysone (20E); however, its molecular regulatory basis remains not completely understood.

View Article and Find Full Text PDF

Transposable elements (TEs) are significant drivers of genome evolution, yet their recent dynamics and impacts within and among species, as well as the roles of host genes and non-coding RNAs in the transposition process, remain elusive. With advancements in large-scale pan-genome sequencing and the development of open data sharing, large-scale comparative genomics studies have become feasible. Here, we performed complete de novo TE annotations and identified active TEs in 310 plant genome assemblies across 119 species and seven crop populations.

View Article and Find Full Text PDF

We present the complete mitochondrial genome of from China. The mitogenome of is circular, AT-rich (75.3%), and 15,898 bp in length.

View Article and Find Full Text PDF

Alternative splicing is essential for the generation of various protein isoforms that are involved in cell differentiation and tissue development. In addition to internal coding exons, alternative splicing affects the exons with translation initiation codons; however, little is known about these exons. Here, we performed a systematic classification of human alternative exons using coding information.

View Article and Find Full Text PDF

We describe the phenotypic and genotypic spectrum of patients with vascular anomaly (VA) in a paediatric multi-disciplinary VA clinic. We measured the clinical utility of genotyping by comparing pre and posttest diagnosis and management. A 46-month retrospective analysis occurred for 250 patients offered genetic testing in the VA clinic.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!