Genome sequence assemblies provide the basis for our understanding of biology. Generating error-free assemblies is therefore the ultimate, but sadly still unachieved goal of a multitude of research projects. Despite the ever-advancing improvements in data generation, assembly algorithms and pipelines, no automated approach has so far reliably generated near error-free genome assemblies for eukaryotes. Whilst working towards improved datasets and fully automated pipelines, assembly evaluation and curation is actively used to bridge this shortcoming and significantly reduce the number of assembly errors. In addition to this increase in product value, the insights gained from assembly curation are fed back into the automated assembly strategy and contribute to notable improvements in genome assembly quality. We describe our tried and tested approach for assembly curation using gEVAL, the genome evaluation browser. We outline the procedures applied to genome curation using gEVAL and also our recommendations for assembly curation in a gEVAL-independent context to facilitate the uptake of genome curation in the wider community.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7794651PMC
http://dx.doi.org/10.1093/gigascience/giaa153DOI Listing

Publication Analysis

Top Keywords

assembly curation
12
genome assemblies
8
assembly
8
curation geval
8
genome curation
8
genome
7
curation
7
improving quality
4
quality genome
4
assemblies
4

Similar Publications

Microbial research generates vast and complex data from diverse omics technologies, necessitating innovative analytical solutions. microGalaxy (Galaxy for Microbiology) addresses these needs with a user-friendly platform that integrates 220+ tool suites and 65+ curated workflows for microbial analyses, including taxonomic profiling, assembly, annotation, and functional analysis. Hosted on the main EU Galaxy server (microgalaxy.

View Article and Find Full Text PDF

Background: The advent of next generation sequencing technologies has enabled a surge in the number of whole genome sequences in public databases, and our understanding of the composition and evolution of bacterial genomes. Besides model organisms and pathogens, some attention has been dedicated to industrial bacteria, notably members of the Lactobacillaceae family that are commonly studied and formulated as probiotic bacteria. Of particular interest is Lactobacillus acidophilus NCFM, an extensively studied strain that has been widely commercialized for decades and is being used for the delivery of vaccines and therapeutics.

View Article and Find Full Text PDF

Background: Enhanced biological phosphorus removal (EBPR) systems utilize phosphorus-accumulating organisms (PAOs) to remove phosphorus from wastewater since excessive phosphorus in water bodies can lead to eutrophication. This study aimed to characterize a newly isolated PAO strain for its potential application in EBPR systems and to screen for additional biotechnological potential. Here, sequencing allowed for genomic analysis, identifying the genes and molecules involved, and exploring other potentials.

View Article and Find Full Text PDF

Satellite DNAs (satDNAs) are tandemly repeated sequences that make up a significant portion of almost all eukaryotic genomes. Although satDNAs have been shown to play an important role in genome organization and evolution, they are relatively poorly analyzed, even in model organisms. One of the main reasons for the current lack of in-depth studies on satDNAs is their underrepresentation in genome assemblies.

View Article and Find Full Text PDF

Background: Diaphorina citri is an insect vector of "Candidatus Liberibacter asiaticus" (CLas), the gram-negative bacterial pathogen associated with citrus greening disease. Control measures rely on pesticides with negative impacts on the environment, natural ecosystems, and human and animal health. In contrast, gene-targeting methods have the potential to specifically target the vector species and/or reduce pathogen transmission.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!