Advances in long-read sequencing technologies and genome assembly methods have enabled the recent completion of the first telomere-to-telomere human genome assembly, which resolves complex segmental duplications and large tandem repeats, including centromeric satellite arrays in a complete hydatidiform mole (CHM13). Although derived from highly accurate sequences, evaluation revealed evidence of small errors and structural misassemblies in the initial draft assembly. To correct these errors, we designed a new repeat-aware polishing strategy that made accurate assembly corrections in large repeats without overcorrection, ultimately fixing 51% of the existing errors and improving the assembly quality value from 70.2 to 73.9 measured from PacBio high-fidelity and Illumina k-mers. By comparing our results to standard automated polishing tools, we outline common polishing errors and offer practical suggestions for genome projects with limited resources. We also show how sequencing biases in both high-fidelity and Oxford Nanopore Technologies reads cause signature assembly errors that can be corrected with a diverse panel of sequencing technologies.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9812399PMC
http://dx.doi.org/10.1038/s41592-022-01440-3DOI Listing

Publication Analysis

Top Keywords

sequencing technologies
8
genome assembly
8
assembly
6
errors
5
chasing perfection
4
perfection validation
4
polishing
4
validation polishing
4
polishing strategies
4
strategies telomere-to-telomere
4

Similar Publications

Background: To develop and validate a clinical-radiomics model for preoperative prediction of lymphovascular invasion (LVI) in rectal cancer.

Methods: This retrospective study included data from 239 patients with pathologically confirmed rectal adenocarcinoma from two centers, all of whom underwent MRI examinations. Cases from the first center (n = 189) were randomly divided into a training set and an internal validation set at a 7:3 ratio, while cases from the second center (n = 50) constituted the external validation set.

View Article and Find Full Text PDF

Early cancer detection substantially improves the rate of patient survival; however, conventional screening methods are directed at single anatomical sites and focus primarily on a limited number of cancers, such as gastric, colorectal, lung, breast, and cervical cancer. Additionally, several cancers are inadequately screened, hindering early detection of 45.5% cases.

View Article and Find Full Text PDF

Liaoning cashmere goat is an outstanding breed in China primarily for cashmere production, with strict controls against genetic outflow. Melatonin(MT) is a key factor affecting cashmere growth, and preliminary transcriptome sequencing indicated that melatonin upregulates the expression of the PIP5K1A gene in skin fibroblasts. To predict the physicochemical properties of PIP5K1A in Liaoning cashmere goats, ascertain the tissue localization of PIP5K1A in their skin, and explore the role and mechanism of PIP5K1A in the proliferation of skin fibroblasts.

View Article and Find Full Text PDF

The anti-Stokes emission of photon upconversion nanoparticles (UCNPs) facilitates their use as labels for ultrasensitive detection in biological samples as infrared excitation does not induce autofluorescence at visible wavelengths. The detection of extremely low-abundance analytes, however, remains challenging as it is impossible to completely avoid nonspecific binding of label conjugates. To overcome this limitation, we developed a novel hybridization complex transfer technique using UCNP labels to detect short nucleic acids directly without target amplification.

View Article and Find Full Text PDF

Background: Zostera marina is an important ecosystem engineer influencing shallow water environments and possibly shaping the microbiota in surrounding sediments and water. Z. marina is typically found in marine systems, but it can also proliferate under brackish conditions.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!