Sequencing error profiles of Illumina sequencing instruments.

NAR Genom Bioinform

Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA 16802, USA.

Published: March 2021

Sequencing technology has achieved great advances in the past decade. Studies have previously shown the quality of specific instruments in controlled conditions. Here, we developed a method able to retroactively determine the error rate of most public sequencing datasets. To do this, we utilized the overlaps between reads that are a feature of many sequencing libraries. With this method, we surveyed 1943 different datasets from seven different sequencing instruments produced by Illumina. We show that among public datasets, the more expensive platforms like HiSeq and NovaSeq have a lower error rate and less variation. But we also discovered that there is great variation within each platform, with the accuracy of a sequencing experiment depending greatly on the experimenter. We show the importance of sequence context, especially the phenomenon where preceding bases bias the following bases toward the same identity. We also show the difference in patterns of sequence bias between instruments. Contrary to expectations based on the underlying chemistry, HiSeq X Ten and NovaSeq 6000 share notable exceptions to the preceding-base bias. Our results demonstrate the importance of the specific circumstances of every sequencing experiment, and the importance of evaluating the quality of each one.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8002175PMC
http://dx.doi.org/10.1093/nargab/lqab019DOI Listing

Publication Analysis

Top Keywords

sequencing
8
sequencing instruments
8
error rate
8
sequencing experiment
8
sequencing error
4
error profiles
4
profiles illumina
4
illumina sequencing
4
instruments
4
instruments sequencing
4

Similar Publications

The use of proteins as intracellular probes and therapeutic tools is often limited by poor intracellular delivery. One approach to enabling intracellular protein delivery is to transform proteins into spherical nucleic acid (proSNA) nanoconstructs, with surfaces chemically modified with a dense shell of radially oriented DNA that can engage with cell-surface receptors that facilitate endocytosis. However, proteins often have a limited number of available reactive surface residues for DNA conjugation such that the extent of DNA loading and cellular uptake is restricted.

View Article and Find Full Text PDF

Chemoselectivity in Pd-Based Dyotropic Rearrangement: Development and Application in Total Synthesis of Pheromones.

J Am Chem Soc

January 2025

Laboratory of Synthesis and Natural Products (LSPN), Institute of Chemical Sciences and Engineering, Ecole Polytechnique Fédérale de Lausanne, EPFL-SB-ISIC-LSPN, BCH 5304, CH-1015 Lausanne, Switzerland.

In the dyotropic rearrangement of molecules with semiflexible structures, characterized by a freely rotating static C-C bond, the formation of a mixture of products is common due to the coexistence of several energetically comparable conformers. Herein, we report that it is possible to modulate the shifting groups by adjusting the metal's coordination sphere in Pd-based dyotropic rearrangement. In the presence of a catalytic amount of Pd(II) salt, the reaction of γ-hydroxyalkenes or γ,δ-dihydroxyalkenes with Selectfluor affords fluorinated tetrahydropyranols or 6,8-dioxabicyclo[3.

View Article and Find Full Text PDF

Oral Microbiota Associated with Clinical Efficacy of Ustekinumab in Crohn's Disease.

Endocr Metab Immune Disord Drug Targets

January 2025

Department of Stomatology, The Affiliated Huaian No.1 People's Hospital, Nanjing Medical University, No.1 Huanghe West Road, Huaian, 223300, Jiangsu Province, China.

Background: Crohn's Disease (CD) is a chronic inflammatory gastrointestinal disease. Ustekinumab (UST) has been utilized as a therapeutic option for CD patients. However, approximately 40-60% of patients exhibit an inadequate response to UST.

View Article and Find Full Text PDF

When introduced to multiple distinct ranges, invasive species provide a compelling natural experiment for understanding the repeatability of adaptation. Ambrosia artemisiifolia is an invasive, noxious weed, and chief cause of hay fever. Leveraging over 400 whole-genome sequences spanning the native-range in North America and 2 invasions in Europe and Australia, we inferred demographically distinct invasion histories on each continent.

View Article and Find Full Text PDF

Biphasic Coacervation Controlled by Kinetics as Studied by De Novo-Designed Peptides.

Langmuir

January 2025

Beijing National Laboratory for Molecular Sciences, Department of Polymer Science and Engineering and the Key Laboratory of Polymer Chemistry and Physics of the Ministry of Education, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China.

Coacervation is generally treated as a liquid-liquid phase separation process and is controlled mainly by thermodynamics. However, kinetics could make a dominant contribution, especially in systems containing multiple interactions. In this work, using peptides of (XXLY)SSSGSS to tune the charge density and the degree of hydrophobicity, as well as to introduce secondary structures, we evaluated the effect of kinetics on biphasic coacervates formed by peptides with single-stranded oligonucleotides and quaternized dextran at varying pH values.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!