AI Article Synopsis

  • - ONT's long-read sequencing allows for direct sequencing of epigenetic modifications but has lower accuracy, necessitating improved basecalling methods by utilizing species-specific models.
  • - Research involved testing ONT's sequencing on two plants using both ONT PromethION and PacBio Sequel II HiFi technologies, showing better accuracy with species-specific models and improved flowcells.
  • - Results indicated that though ONT Guppy versions yielded high read accuracies, using mixed-species models potentially lowers overall accuracy, suggesting the need for tailored models for each species for optimal results.

Article Abstract

Background: Long-read sequencing platforms offered by Oxford Nanopore Technologies (ONT) allow native DNA containing epigenetic modifications to be directly sequenced, but can be limited by lower per-base accuracies. A key step post-sequencing is basecalling, the process of converting raw electrical signals produced by the sequencing device into nucleotide sequences. This is challenging as current basecallers are primarily based on mixtures of model species for training. Here we utilise both ONT PromethION and higher accuracy PacBio Sequel II HiFi sequencing on two plants, Phebalium stellatum and Xanthorrhoea johnsonii, to train species-specific basecaller models with the aim of improving per-base accuracy. We investigate sequencing accuracies achieved by ONT basecallers and assess accuracy gains by training single-species and species-specific basecaller models. We also evaluate accuracy gains from ONT's improved flowcells (R10.4, FLO-PRO112) and sequencing kits (SQK-LSK112). For the truth dataset for both model training and accuracy assessment, we developed highly accurate, contiguous diploid reference genomes with PacBio Sequel II HiFi reads.

Results: Basecalling with ONT Guppy 5 and 6 super-accurate gave almost identical results, attaining read accuracies of 91.96% and 94.15%. Guppy's plant-specific model gave highly mixed results, attaining read accuracies of 91.47% and 96.18%. Species-specific basecalling models improved read accuracy, attaining 93.24% and 95.16% read accuracies. R10.4 sequencing kits also improve sequencing accuracy, attaining read accuracies of 95.46% (super-accurate) and 96.87% (species-specific).

Conclusions: The use of a single mixed-species basecaller model, such as ONT Guppy super-accurate, may be reducing the accuracy of nanopore sequencing, due to conflicting genome biology within the training dataset and study species. Training of single-species and genome-specific basecaller models improves read accuracy. Studies that aim to do large-scale long-read genotyping would primarily benefit from training their own basecalling models. Such studies could use sequencing accuracy gains and improving bioinformatics tools to improve study outcomes.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9749173PMC
http://dx.doi.org/10.1186/s13007-022-00971-2DOI Listing

Publication Analysis

Top Keywords

read accuracies
16
basecaller models
12
accuracy gains
12
attaining read
12
accuracy
11
sequencing
10
accuracy nanopore
8
nanopore sequencing
8
sequencing plants
8
species training
8

Similar Publications

Background: This retrospective study aims to evaluate the impact of a content-based image retrieval (CBIR) application on diagnostic accuracy and confidence in interstitial lung disease (ILD) assessment using high-resolution computed tomography CT (HRCT).

Methods: Twenty-eight patients with verified pattern-based ILD diagnoses were split into two equal datasets (1 and 2). The images were assessed by two radiology residents (3rd and 5th year) and one expert radiologist in four sessions.

View Article and Find Full Text PDF

Objective: There is limited research on weight bias in diagnosing eating disorders (EDs), particularly among healthcare professionals (HCPs). This is especially true for atypical anorexia nervosa, a diagnosis recently described in the DSM that includes people with anorexia nervosa symptoms who are not clinically underweight.

Method: Using a within-subjects design, we assessed diagnosis, diagnostic confidence, and ED-related medical knowledge among a sample of lay people and medical professionals.

View Article and Find Full Text PDF

Approach to Publishing a Scientific (Radiology) Book.

Indian J Radiol Imaging

January 2025

Department of Radiodiagnosis and Interventional Radiology, All India Institute of Medical Sciences, New Delhi, India.

In the modern landscape of information technology, the role of books remains pivotal in education and research, especially in scientific fields such as radiology. This article outlines a comprehensive approach to publishing a scientific book in radiology, from the initial concept to distribution and ongoing updates. The process is influenced by factors such as the author's motivation, expertise, and target audience.

View Article and Find Full Text PDF

Objective: We aimed to evaluate the content and quality of websites for consumers providing information about human papillomavirus (HPV) risks in patients with systemic lupus erythematosus (SLE).

Methods: We conducted an environmental scan of websites for patients and the general public with information about HPV and SLE. We searched Google from inception to June 2023, using the terms "HPV" and "lupus".

View Article and Find Full Text PDF

With a focus on content-area reading, this study aimed to (a) understand the sources and prevalence of concurrent and specific difficulties in word-level skills, vocabulary, and knowledge among adolescent struggling readers (ASRs) and (b) explore the relations among reading skills, profiles, and reading comprehension. A dual-measure screening approach was used to classify a sample of 492 seventh- and eighth-graders. Among the subgroup of 225 ASRs, five distinct profiles were identified by latent profile analysis.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!