AI Article Synopsis

  • RNA-Seq data varies in transcript abundance, complicating decisions on the amount of sequencing needed for less expressed genes and the impact of sequencing technology biases.
  • Analysis of high-depth libraries shows that exomic sequence assembly levels off at around 2 to 8 Gbp, while genomic sequence assembly continues to improve across various species.
  • Both Illumina HiSeq and MGI DNBseq™ technologies recover a similar number of full-length transcripts, though HiSeq may miss some regions due to GC content issues, potentially linked to library preparation.

Article Abstract

Background: RNA-Seq data is inherently nonuniform for different transcripts because of differences in gene expression. This makes it challenging to decide how much data should be generated from each sample. How much should one spend to recover the less expressed transcripts? The sequencing technology used is another consideration, as there are inevitably always biases against certain sequences. To investigate these effects, we first looked at high-depth libraries from a set of well-annotated organisms to ascertain the impact of sequencing depth on de novo assembly. We then looked at libraries sequenced from the Universal Human Reference RNA (UHRR) to compare the performance of Illumina HiSeq and MGI DNBseq™ technologies.

Results: On the issue of sequencing depth, the amount of exomic sequence assembled plateaued using data sets of approximately 2 to 8 Gbp. However, the amount of genomic sequence assembled did not plateau for many of the analyzed organisms. Most of the unannotated genomic sequences are single-exon transcripts whose biological significance will be questionable for some users. On the issue of sequencing technology, both of the analyzed platforms recovered a similar number of full-length transcripts. The missing "gap" regions in the HiSeq assemblies were often attributed to higher GC contents, but this may be an artefact of library preparation and not of sequencing technology.

Conclusions: Increasing sequencing depth beyond modest data sets of less than 10 Gbp recovers a plethora of single-exon transcripts undocumented in genome annotations. DNBseq™ is a viable alternative to HiSeq for de novo RNA-Seq assembly.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6651908PMC
http://dx.doi.org/10.1186/s12864-019-5965-xDOI Listing

Publication Analysis

Top Keywords

sequencing depth
16
impact sequencing
8
novo rna-seq
8
rna-seq assembly
8
sequencing technology
8
issue sequencing
8
sequence assembled
8
data sets
8
sets gbp
8
single-exon transcripts
8

Similar Publications

Stepwise Modulation of Bridged Single-Benzene-Based Fluorophores for Materials Science.

Chemistry

December 2024

Universitat Duisburg-Essen, Institute of organic chemistry, Universitätsstraße 7, 45117, Essen, GERMANY.

In recent years, researchers studying fluorogenic samples have steadily shifted from using large, expensive, poorly soluble fluorophores with complex synthetic sequences to smaller, simpler p scaffolds with low molecular weight. This research article presents an in-depth study of the photophysical properties of five bridged single-benzene-based fluorophores (SBBFs) investigated for their solution and solid-state emission (SSSE) properties. The compounds O4, N1O3, N2O2, N3O1, and N4 are derived from a central terephthalonitrile core and vary in the amount of oxygen and nitrogen bridging atoms.

View Article and Find Full Text PDF

The genome-wide chromosome conformation capture method, Hi-C, has greatly advanced our understanding of genome organization. However, its quantitative properties, including sensitivity, bias, and linearity, remain challenging to assess. Measuring these properties is difficult due to the heterogenous and dynamic nature of chromosomal interactions.

View Article and Find Full Text PDF

Discovery of myosin light chain kinase gene variant in a patient with tetralogy of Fallot suffering aortic dissection: Implications for pathogenesis and the role of family and population screening.

Int J Cardiol Congenit Heart Dis

December 2024

Department of Cardiovascular Sciences and NIHR Leicester Biomedical Research Centre, University of Leicester, College of Medicine Biological Sciences and Psychology, Glenfield Hospital, Groby Road LE39QP, Leicester, UK.

Background: Thoracic aortic dissection (TAD) is an uncommon complication in patients with Tetralogy of Fallot (TOF). Information concerning risk factors for TAD in patients with TOF is very limited.

Methods: We report a case of Stanford type A TAD in a female patient with previously repaired TOF.

View Article and Find Full Text PDF

This study investigated age-related changes in the gut microbiota and metabolome of Sapsaree dogs through metagenomic and metabolomic analyses. Using Illumina (short-read) and Nanopore (long-read) sequencing technologies, we identified both common and unique bacterial genera in the dogs across different age groups. In metagenomic analysis, Firmicutes were predominant at the family level.

View Article and Find Full Text PDF

Background: The gene C9orf72 harbors a non-coding hexanucleotide repeat expansion known to cause amyotrophic lateral sclerosis and frontotemporal dementia. While previous studies have estimated the length of this repeat expansion in multiple tissues, technological limitations have impeded researchers from exploring additional features, such as methylation levels.

Methods: We aimed to characterize C9orf72 repeat expansions using a targeted, amplification-free long-read sequencing method.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!

A PHP Error was encountered

Severity: Notice

Message: fwrite(): Write of 34 bytes failed with errno=28 No space left on device

Filename: drivers/Session_files_driver.php

Line Number: 272

Backtrace:

A PHP Error was encountered

Severity: Warning

Message: session_write_close(): Failed to write session data using user defined save handler. (session.save_path: /var/lib/php/sessions)

Filename: Unknown

Line Number: 0

Backtrace: