Ascosphaera apis is a widespread fungal pathogen of honeybee larvae that results in chalkbrood disease, leading to heavy losses for the beekeeping industry in China and many other countries. This work was aimed at generating a full-length transcriptome of A. apis using PacBio single-molecule real-time (SMRT) sequencing. Here, more than 23.97 Gb of clean reads was generated from long-read sequencing of A. apis mycelia, including 464,043 circular consensus sequences (CCS) and 394,142 full-length non-chimeric (FLNC) reads. In total, we identified 174,095 high-confidence transcripts covering 5141 known genes with an average length of 2728 bp. We also discovered 2405 genic loci and 11,623 isoforms that have not been annotated yet within the current reference genome. Additionally, 16,049, 10,682, 4520 and 7253 of the discovered transcripts have annotations in the Non-redundant protein (Nr), Clusters of Eukaryotic Orthologous Groups (KOG), Gene Ontology (GO), and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Moreover, 1205 long non-coding RNAs (lncRNAs) were identified, which have less exons, shorter exon and intron lengths, shorter transcript lengths, lower GC percent, lower expression levels, and fewer alternative splicing (AS) evens, compared with protein-coding transcripts. A total of 253 members from 17 transcription factor (TF) families were identified from our transcript datasets. Finally, the expression of A. apis isoforms was validated using a molecular approach. Overall, this is the first report of a full-length transcriptome of entomogenous fungi including A. apis. Our data offer a comprehensive set of reference transcripts and hence contributes to improving the genome annotation and transcriptomic study of A. apis.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.jip.2020.107475DOI Listing

Publication Analysis

Top Keywords

full-length transcriptome
12
ascosphaera apis
8
apis
7
reconstruction functional
4
functional annotation
4
annotation ascosphaera
4
full-length
4
apis full-length
4
transcriptome utilizing
4
utilizing pacbio
4

Similar Publications

Transcriptional activation of the embryonic genome (EGA) is a major developmental landmark enabling the embryo to become independent from maternal control. The magnitude and control of transcriptional reprogramming during this event across mammals remains poorly understood. Here, we developed Smart-seq+5' for high sensitivity, full-length transcript coverage and simultaneous capture of 5' transcript information from single cells and single embryos.

View Article and Find Full Text PDF

A trigger-inducible split-Csy4 architecture for programmable RNA modulation.

Nucleic Acids Res

January 2025

Research Center for Life Sciences Computing, Zhejiang Lab, Kechuang Avenue, Yuhang District, Hangzhou, Zhejiang, 311121, China.

The CRISPR-derived endoribonuclease Csy4 is a popular tool for controlling transgene expression in various therapeutically relevant settings, but adverse effects potentially arising from non-specific RNA cleavage remains largely unexplored. Here, we report a split-Csy4 architecture that was carefully optimized for in vivo usage. First, we separated Csy4 into two independent protein moieties whose full catalytic activity can be restored via various constitutive or conditional protein dimerization systems.

View Article and Find Full Text PDF

Clair3-RNA: A deep learning-based small variant caller for long-read RNA sequencing data.

bioRxiv

January 2025

Department of Computer Science, School of Computing and Data Science, University of Hong Kong, Hong Kong, China.

Variant calling using long-read RNA sequencing (lrRNA-seq) can be applied to diverse tasks, such as capturing full-length isoforms and gene expression profiling. It poses challenges, however, due to higher error rates than DNA data, the complexities of transcript diversity, RNA editing events, etc. In this paper, we propose Clair3-RNA, the first deep learning-based variant caller tailored for lrRNA-seq data.

View Article and Find Full Text PDF

Genomic insights into fibrinogen-related proteins and expression analysis in the Pacific white shrimp, Litopenaeusvannamei.

Fish Shellfish Immunol

January 2025

Key Laboratory of Breeding Biotechnology and Sustainable Aquaculture (CAS), Institute of Oceanology, Chinese Academy of Sciences, Qingdao, 266071, China; Laboratory for Marine Biology and Biotechnology, Qingdao Marine Science and Technology Center, Qingdao, 266071, China. Electronic address:

Fibrinogen-related domain (FReD) containing proteins are an evolutionarily conserved immune gene family characterized by the C-terminal fibrinogen (FBG) and diverse N-terminal domains. To understand the complexity of this family in crustaceans, we performed genome screening and identified 43 full-length FReDs encoding genes in Litopenaeus vannamei. Structural classification analysis revealed these putative FReDs could be divided into six types, including two reported types (LvFReDI and II) and four new types (LvFReDIII-VI).

View Article and Find Full Text PDF

Chromosome-level genome assembly of the seasonally polyphenic scorpionfly (Panorpa liui).

Sci Data

January 2025

Hubei Insect Resources Utilization and Sustainable Pest Management Key Laboratory, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, 430070, China.

Mecoptera is a small relict order of insects within the Holometabola. Panorpidae is the most speciose family in Mecoptera. They are also known as scorpion flies due to the enlarged and upward recurved male genital bulb.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!