TransIntegrator: capture nearly full protein-coding transcript variants via integrating Illumina and PacBio transcriptomes.

Brief Bioinform

State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Faculty of Medicine and Life Sciences, Xiamen University, 361102, Xiamen, China.

Published: September 2023

Genes have the ability to produce transcript variants that perform specific cellular functions. However, accurately detecting all transcript variants remains a long-standing challenge, especially when working with poorly annotated genomes or without a known genome. To address this issue, we have developed a new computational method, TransIntegrator, which enables transcriptome-wide detection of novel transcript variants. For this, we determined 10 Illumina sequencing transcriptomes and a PacBio full-length transcriptome for consecutive embryo development stages of amphioxus, a species of great evolutionary importance. Based on the transcriptomes, we employed TransIntegrator to create a comprehensive transcript variant library, namely iTranscriptome. The resulting iTrancriptome contained 91 915 distinct transcript variants, with an average of 2.4 variants per gene. This substantially improved current amphioxus genome annotation by expanding the number of genes from 21 954 to 38 777. Further analysis manifested that the gene expansion was largely ascribed to integration of multiple Illumina datasets instead of involving the PacBio data. Moreover, we demonstrated an example application of TransIntegrator, via generating iTrancriptome, in aiding accurate transcriptome assembly, which significantly outperformed other hybrid methods such as IDP-denovo and Trinity. For user convenience, we have deposited the source codes of TransIntegrator on GitHub as well as a conda package in Anaconda. In summary, this study proposes an affordable but efficient method for reliable transcriptomic research in most species.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bib/bbad334DOI Listing

Publication Analysis

Top Keywords

transcript variants
20
transcript
6
variants
6
transintegrator
5
transintegrator capture
4
capture full
4
full protein-coding
4
protein-coding transcript
4
variants integrating
4
integrating illumina
4

Similar Publications

A genomic variation map provides insights into potato evolution and key agronomic traits.

Mol Plant

January 2025

Inner Mongolia Potato Engineering and Technology Research Centre, Key Laboratory of Herbage and Endemic Crop Biology, Ministry of Education, School of Life Sciences, Inner Mongolia University, Hohhot 010021, China. Electronic address:

Hybrid potato breeding based on diploid inbred lines is transforming the way of genetic improvement of this staple food crop, which requires a deep understanding of potato domestication and differentiation. Here, we resequenced 314 diploid wild and landrace accessions to generate a variome map of 47,203,407 variants. Using the variome map, we discovered the reshaping of tuber transcriptome during potato domestication, characterized genome-wide differentiation between landrace groups Stenotomum and Phureja, and identified a jasmonic acid biosynthetic gene possibly affecting tuber dormancy period.

View Article and Find Full Text PDF

Dynamic Roles of RNA and RNA Epigenetics in HTLV-1 Biology.

Viruses

January 2025

Center for Retrovirus Research, Department of Veterinary Biosciences, The Ohio State University, Columbus, OH 43210, USA.

Since the discovery of RNA in the early 1900s, scientific understanding of RNA form and function has evolved beyond protein coding. Viruses, particularly retroviruses like human T-cell leukemia virus type 1 (HTLV-1), rely heavily on RNA and RNA post-transcriptional modifications to regulate the viral lifecycle, pathogenesis, and evasion of host immune responses. With the emergence of new sequencing technologies in the last decade, our ability to dissect the intricacies of RNA has flourished.

View Article and Find Full Text PDF

Patterns of Isoform Variation for N Gene Subgenomic mRNAs in Betacoronavirus Transcriptomes.

Viruses

December 2024

Department of Biology, Center for Computational and Integrative Biology, Rutgers University, Camden, NJ 08102, USA.

The nucleocapsid (N) protein is the most expressed protein in later stages of SARS-CoV-2 infection with several important functions. It is translated from a subgenomic mRNA (sgmRNA) formed by template switching during transcription. A recently described translation initiation site (TIS) with a CTG codon in the leader sequence (TIS-L) is out of frame with most structural and accessory genes including the N gene and may act as a translation suppressor.

View Article and Find Full Text PDF

Analyzing the genetic architecture of hereditary forms of diabetes in different populations is a critical step toward optimizing diagnostic and preventive algorithms. This requires consideration of regional and population-specific characteristics, including the spectrum and frequency of pathogenic variants in targeted genes. As part of this study, we used a custom-designed NGS panel to screen for mutations in 28 genes associated with the pathogenesis of hereditary diabetes mellitus in 506 unrelated patients from Russia.

View Article and Find Full Text PDF

Identification and Functional Analysis of Candidate Genes Influencing Citrus Leaf Size Through Transcriptome and Coexpression Network Approaches.

Genes (Basel)

January 2025

Guangxi Key Laboratory of Germplasm Innovation and Utilization of Specialty Commercial Crops in North Guangxi, Guangxi Citrus Breeding and Cultivation Technology Innovation Center, Guangxi Academy of Specialty Crops, Guilin 541004, China.

Background: Leaves are the main organs involved in photosynthesis. They capture light energy and promote gas exchange, and their size and shape affect yield. Identifying the regulatory networks and key genes that control citrus leaf size is essential for increasing citrus crop yield.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!