Increasing read length is currently viewed as the crucial condition for fragment assembly with next-generation sequencing technologies. However, introducing mate-paired reads (separated by a gap of length, GapLength) opens a possibility to transform short mate-pairs into long mate-reads of length approximately GapLength, and thus raises the question as to whether the read length (as opposed to GapLength) even matters. We describe a new tool, EULER-USR, for assembling mate-paired short reads and use it to analyze the question of whether the read length matters. We further complement the ongoing experimental efforts to maximize read length by a new computational approach for increasing the effective read length. While the common practice is to trim the error-prone tails of the reads, we present an approach that substitutes trimming with error correction using repeat graphs. An important and counterintuitive implication of this result is that one may extend sequencing reactions that degrade with length "past their prime" to where the error rate grows above what is normally acceptable for fragment assembly.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2652199PMC
http://dx.doi.org/10.1101/gr.079053.108DOI Listing

Publication Analysis

Top Keywords

read length
24
fragment assembly
12
length
9
mate-paired reads
8
length gaplength
8
question read
8
read
6
novo fragment
4
assembly short
4
short mate-paired
4

Similar Publications

This prospective pilot study examined the association between microorganisms and knee osteoarthritis by identifying pathogens in the synovial membrane, synovial fluid, and blood samples from two patients with primary bilateral knee osteoarthritis, using metagenomic next-generation sequencing (mNGS). Intraoperatively, during routine knee arthroplasty procedures, we collected the following 12 samples from each patient: two synovial membrane samples, two synovial fluid samples, and two venous blood samples. After DNA isolation and library construction, each sample was subjected to deep whole-genome sequencing using the DNBSEQT17 platform with the read length PE150 as the default.

View Article and Find Full Text PDF

Comprehensive discovery and functional characterization of the noncanonical proteome.

Cell Res

January 2025

The Center for RNA Medicine, International Institutes of Medicine, International School of Medicine, The 4th Affiliated Hospital of Zhejiang University School of Medicine, Yiwu, Zhejiang, China.

The systematic identification and functional characterization of noncanonical translation products, such as novel peptides, will facilitate the understanding of the human genome and provide new insights into cell biology. Here, we constructed a high-coverage peptide sequencing reference library with 11,668,944 open reading frames and employed an ultrafiltration tandem mass spectrometry assay to identify novel peptides. Through these methods, we discovered 8945 previously unannotated peptides from normal gastric tissues, gastric cancer tissues and cell lines, nearly half of which were derived from noncoding RNAs.

View Article and Find Full Text PDF

Background And Objective: Personal wheelchair budgets (PWBs) are offered to everyone in England eligible for a wheelchair provided through the National Health Service (NHS) to support their choice of equipment. The WATCh (Wheelchair outcomes Assessment Tool for Children) and related WATCh-Ad for adults are patient-centred outcome measures (PCOMs) developed to help individual users express their main outcome needs when obtaining a wheelchair and rate their satisfaction with subsequent outcomes after receiving their equipment. Use was explored in a real-world setting, aiming to produce guidance for use alongside the PWB process.

View Article and Find Full Text PDF

The surveillance of mobile genetic elements facilitating the spread of antimicrobial resistance genes has been challenging. Here, we tracked both clonal and plasmid transmission in colistin- and carbapenem-resistant using short- and long-read sequencing technologies. We observed three clonal transmissions, all containing Incompatibility group (Inc) L plasmids and New Delhi metallo-beta-lactamase , although not co-located on the same plasmid.

View Article and Find Full Text PDF

The integration of barcode technology with smartphones on paper-based analytical devices (PADs) presents a promising approach to bridging manual detection with digital interpretation and data storage. However, previous studies of 1D barcode approaches have been limited to providing only a "yes/no" response for analyte detection. Herein, a method of using barcode readout for semiquantitative signal detection on PADs has been achieved through the integration of barcode technology with a distance-based measurement concept on PADs.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!