Although current long-read sequencing technologies have a long-read length that facilitates assembly for genome reconstruction, they have high sequence errors. While various assemblers with different perspectives have been developed, no systematic evaluation of assemblers with long reads for diploid genomes with varying heterozygosity has been performed. Here, we evaluated a series of processes, including the estimation of genome characteristics such as genome size and heterozygosity, de novo assembly, polishing, and removal of allelic contigs, using six genomes with various heterozygosity levels. We evaluated five long-read-only assemblers (Canu, Flye, miniasm, NextDenovo and Redbean) and five hybrid assemblers that combine short and long reads (HASLR, MaSuRCA, Platanus-allee, SPAdes and WENGAN) and proposed a concrete guideline for the construction of haplotype representation according to the degree of heterozygosity, followed by polishing and purging haplotigs, using stable and high-performance assemblers: Redbean, Flye and MaSuRCA.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10555665PMC
http://dx.doi.org/10.1093/bib/bbad337DOI Listing

Publication Analysis

Top Keywords

long reads
8
heterozygosity
5
assemblers
5
practical assembly
4
assembly guideline
4
guideline genomes
4
genomes levels
4
levels heterozygosity
4
heterozygosity current
4
current long-read
4

Similar Publications

Background: Identification of global transcriptional events is crucial for genome annotation, as accurate annotation enhances the efficiency and comparability of genomic information across species. However, the annotation of transcripts in the cucumber genome remains to be improved, and many transcriptional events have not been well studied.

Results: We collected 1,904 high-quality public cucumber transcriptome samples from the National Center for Biotechnology Information (NCBI) to identify and annotate transcript isoforms in the cucumber genome.

View Article and Find Full Text PDF

Chromosome-level genome assembly of Megachile sculpturalis Smith (Hymenoptera, Apoidea, Megachilidae).

Sci Data

January 2025

Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China.

Megachile sculpturalis Smith, 1853 native to East Asia, is an important solitary bee species that has invaded both Europe and the United States. This study provides the first chromosome-level genome assembly of M. sculpturalis using a combination of Nanopore long reads, Illumina short reads, and Hi-C data.

View Article and Find Full Text PDF

Carbene-metal-amide (CMA) complexes have diverse applications in luminescence, imaging and sensing. In this study, we designed and synthesized a series of CMA complexes, which were subsequently doped into a PMMA host. These materials demonstrate light-induced dynamic phosphorescence, attributed to their long intrinsic triplet state lifetime (τP,int, in the μs-ms scale), high intersystem crossing (ISC) rate constant (kISC, up to 107 s-1), and bright phosphorescence.

View Article and Find Full Text PDF

Here, we report the complete genome sequence of a new carlavirus causing mosaic on mint plants in Italy, which we have tentatively named "mint virus C" (MVC). Flexuous particles of around 600 nm were observed using transmission electron microscopy, and next-generation sequencing was performed to determine the nucleotide sequence of the MVC genome, which was found to be 8558 nt long, excluding the poly(A) tail, and shows the typical organization of a carlavirus. The putative proteins encoded by MVC are 44-56% identical to the closest matches in the NCBI database, suggesting that MVC should be considered a member of a new species in the genus Carlavirus.

View Article and Find Full Text PDF

Acoustic Characteristics of Voice and Speech in Post-COVID-19.

Healthcare (Basel)

January 2025

Department of Computer Science, Institute of Mathematics and Statistics, University of São Paulo (USP), São Paulo 05508-220, SP, Brazil.

Background/objectives: The aim of this paper was to compare voice and speech characteristics between post-COVID-19 and control subjects. The hypothesis was that acoustic parameters of voice and speech may differentiate subjects infected by COVID-19 from control subjects. Additionally, we expected to observe the persistence of symptoms in women.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!