Background: Sequencing technology and assembly algorithms have matured to the point that high-quality de novo assembly is possible for large, repetitive genomes. Current assemblies traverse transposable elements (TEs) and provide an opportunity for comprehensive annotation of TEs. Numerous methods exist for annotation of each class of TEs, but their relative performances have not been systematically compared. Moreover, a comprehensive pipeline is needed to produce a non-redundant library of TEs for species lacking this resource to generate whole-genome TE annotations.
Results: We benchmark existing programs based on a carefully curated library of rice TEs. We evaluate the performance of methods annotating long terminal repeat (LTR) retrotransposons, terminal inverted repeat (TIR) transposons, short TIR transposons known as miniature inverted transposable elements (MITEs), and Helitrons. Performance metrics include sensitivity, specificity, accuracy, precision, FDR, and F. Using the most robust programs, we create a comprehensive pipeline called Extensive de-novo TE Annotator (EDTA) that produces a filtered non-redundant TE library for annotation of structurally intact and fragmented elements. EDTA also deconvolutes nested TE insertions frequently found in highly repetitive genomic regions. Using other model species with curated TE libraries (maize and Drosophila), EDTA is shown to be robust across both plant and animal species.
Conclusions: The benchmarking results and pipeline developed here will greatly facilitate TE annotation in eukaryotic genomes. These annotations will promote a much more in-depth understanding of the diversity and evolution of TEs at both intra- and inter-species levels. EDTA is open-source and freely available: https://github.com/oushujun/EDTA.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6913007 | PMC |
http://dx.doi.org/10.1186/s13059-019-1905-y | DOI Listing |
Bone
January 2025
ARTORG Center for Biomedical Engineering Research, University of Bern, Bern, Switzerland.
Nonlinear homogenised finite element (hFE) models can accurately predict stiffness and strength of ultra-distal sections of the radius and tibia using in vivo HR-pQCT images. Recent findings showed good stiffness prediction at these distal sections but a limited ability to reproduce experimental strain localisation. The coarseness of voxel-based meshes reduces the computational effort at the cost of heavily simplifying the underlying geometry of the cortex, the gradient of material properties, and the resulting strain distribution.
View Article and Find Full Text PDFTalanta
January 2025
School of Life Sciences, Faculty of Science, The University of Technology Sydney, Sydney, NSW, Australia.
Metabolomics analyses enable the examination and identification of endogenous biochemical reaction products, revealing information on the metabolic pathways and processes active within a living cell or organism. Determination of metabolic shifts can provide important information on a treatment or disease. Unlike other omics fields that typically have analytes of the same chemical class with common building blocks, those that fall under the nomenclature of metabolites encompass a wide array of different compounds with very diverse physiochemical properties.
View Article and Find Full Text PDFBioinformatics
January 2025
European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, United Kingdom.
Summary: In recent years there has been a surge in prokaryotic genome assemblies, coming from both isolated organisms and environmental samples. These assemblies often include novel species that are poorly represented in reference databases creating a need for a tool that can annotate both well-described and novel taxa, and can run at scale. Here, we present mettannotator-a comprehensive, scalable Nextflow pipeline for prokaryotic genome annotation that identifies coding and non-coding regions, predicts protein functions, including antimicrobial resistance, and delineates gene clusters.
View Article and Find Full Text PDFFront Plant Sci
January 2025
Guangdong University of Technology, Guangzhou Higher Education Mega Centre, Guangzhou, China.
Precise segmentation of unmanned aerial vehicle (UAV)-captured images plays a vital role in tasks such as crop yield estimation and plant health assessment in banana plantations. By identifying and classifying planted areas, crop areas can be calculated, which is indispensable for accurate yield predictions. However, segmenting banana plantation scenes requires a substantial amount of annotated data, and manual labeling of these images is both timeconsuming and labor-intensive, limiting the development of large-scale datasets.
View Article and Find Full Text PDFJ Pathol
January 2025
The Institute for Molecular Bioscience, The University of Queensland, St Lucia, Queensland, Australia.
Spatial transcriptomics (ST) offers enormous potential to decipher the biological and pathological heterogeneity in precious archival cancer tissues. Traditionally, these tissues have rarely been used and only examined at a low throughput, most commonly by histopathological staining. ST adds thousands of times as many molecular features to histopathological images, but critical technical issues and limitations require more assessment of how ST performs on fixed archival tissues.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!