Transposable elements are mobile sequences that can move and insert themselves into chromosomes, activating under internal or external stimuli, giving the organism the ability to adapt to the environment. Annotating transposable elements in genomic data is currently considered a crucial task to understand key aspects of organisms such as phenotype variability, species evolution, and genome size, among others. Because of the way they replicate, LTR retrotransposons are the most common transposable elements in plants, accounting in some cases for up to 80% of all DNA information. To annotate these elements, a reference library is usually created, a curation process is performed, eliminating TE fragments and false positives and then annotated in the genome using the homology method. However, the curation process can take weeks, requires extensive manual work and the execution of multiple time-consuming bioinformatics software. Here, we propose a machine learning-based approach to perform this process automatically on plant genomes, obtaining up to 91.18% F1-score. This approach was tested with four plant species, obtaining up to 93.6% F1-score () in only 22.61 s, where bioinformatics methods took approximately 6 h. This acceleration demonstrates that the ML-based approach is efficient and could be used in massive sequencing projects.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9521825 | PMC |
http://dx.doi.org/10.1515/jib-2021-0036 | DOI Listing |
Microb Pathog
January 2025
Cell Biology and Molecular Genetics, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore 575018, INDIA. Electronic address:
Fungal hybrids arise through the interbreeding of distinct species. This hybridization process fosters increased genetic diversity and the emergence of new traits. Mechanisms driving hybridization include the loss of heterozygosity, copy number variations, and horizontal gene transfer.
View Article and Find Full Text PDFUnlabelled: is one of the three most frequently mutated genes in age-related clonal hematopoiesis (CH), alongside and . CH can progress to myeloid malignancies including chronic monomyelocytic leukemia (CMML), and is also strongly associated with inflammatory cardiovascular disease and all-cause mortality in humans. DNMT3A and TET2 regulate DNA methylation and demethylation pathways respectively, and loss-of-function mutations in these genes reduce DNA methylation in heterochromatin, allowing de-repression of silenced elements in heterochromatin.
View Article and Find Full Text PDFInt J Biol Macromol
January 2025
State Key Laboratory of North China Crop Improvement and Regulation, Hebei Agricultural University, Baoding 071000, China; Key Laboratory of Vegetable Germplasm Innovation and Utilization of Hebei, Ministry of Education of China-Hebei Province Joint Innovation Center for Efficient Green Vegetable Industry, College of Horticulture, Hebei Agricultural University, Baoding 071000, China; Division of Plant Sciences, Research School of Biology, Australian National University, Canberra, ACT 2601, Australia. Electronic address:
Fusarium oxysporum f. sp. lycopersici (Fol), the causal agent of tomato wilt disease, is a soil-borne, vascular-colonizing fungal pathogen that severely impacts tomato production in most growing regions worldwide.
View Article and Find Full Text PDFPlants (Basel)
January 2025
Michael Smith Laboratories, University of British Columbia, 2185 East Mall, Vancouver, BC V6T 1Z4, Canada.
Stinging nettles () have a long history of association with human civilization, having been used as a source of textile fibers, food and medicine. Here, we present a chromosome-level, phased genome assembly for a diploid female clone of from Romania. Using a combination of PacBio HiFi, Oxford Nanopore, and Illumina sequencing, as well as Hi-C long-range interaction data (using a novel Hi-C protocol presented here), we assembled two haplotypes of 574.
View Article and Find Full Text PDFGenes Dev
January 2025
Institute for Research on Cancer and Aging of Nice (IRCAN), Institut National de la Santé et de la Recherche Médicale (INSERM), Centre National de la Recherche Scientifique (CNRS), University Cote d'Azur, Nice 06107, France
Long interspersed element-1 (LINE-1) retrotransposons are abundant transposable elements in mammals and significantly influence chromosome structure, chromatin organization, and 3D genome architecture. In this issue of , Ataei et al. (doi:10.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!