Motivation: Transposable Elements (TEs) or jumping genes are DNA sequences that have an intrinsic capability to move within a host genome from one genomic location to another. Studies show that the presence of a TE within or adjacent to a functional gene may alter its expression. TEs can also cause an increase in the rate of mutation and can even mediate duplications and large insertions and deletions in the genome, promoting gross genetic rearrangements. The proper classification of identified jumping genes is important for analyzing their genetic and evolutionary effects. An effective classifier, which can explain the role of TEs in germline and somatic evolution more accurately, is needed. In this study, we examine the performance of a variety of machine learning (ML) techniques and propose a robust method, ClassifyTE, for the hierarchical classification of TEs with high accuracy, using a stacking-based ML method.
Results: We propose a stacking-based approach for the hierarchical classification of TEs. When trained on three different benchmark datasets, our proposed system achieved 4%, 10.68% and 10.13% average percentage improvement (using the hF measure) compared to several state-of-the-art methods. We developed an end-to-end automated hierarchical classification tool based on the proposed approach, ClassifyTE, to classify TEs up to the super-family level. We further evaluated our method on a new TE library generated by a homology-based classification method and found relatively high concordance at higher taxonomic levels. Thus, ClassifyTE paves the way for a more accurate analysis of the role of TEs.
Availability And Implementation: The source code and data are available at https://github.com/manisa/ClassifyTE.
Supplementary Information: Supplementary data are available at Bioinformatics online.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1093/bioinformatics/btab146 | DOI Listing |
CAZymes ( C arbohydrate A ctive En Zymes ) degrade, synthesize, and modify all complex carbohydrates on Earth. CAZymes are extremely important to research in human health, nutrition, gut microbiome, bioenergy, plant disease, and global carbon recycling. Current CAZyme annotation tools are all based on sequence similarity.
View Article and Find Full Text PDFPersonal Ment Health
February 2025
University of Houston, Houston, Texas, USA.
More work is needed to establish the validity of the Alternative Model of Personality Disorders (AMPD) in the Diagnostic and Statistical Manual of Mental Disorders (DSM-5). Acceptance of the AMPD as the primary model of personality disorder requires identifying neurocognitive validators of AMPD-defined personality functioning and demonstrating superiority of the AMPD over the traditional categorical model of personality disorder. It is also important to establish the utility of the AMPD in a developmental context given evidence that personality disorder emerges in adolescence.
View Article and Find Full Text PDFAssist Technol
January 2025
Shaanxi Key Laboratory of Behavior and Cognitive Neuroscience, School of Psychology, Shaanxi Normal University, Xi'an, China.
Socially assistive robots (SARs) are increasingly recognized for their potential in helping older adults age in place. Effectively meeting the diverse needs of older adults requires a proper classification of SARs' functions. However, existing function categories are primarily proposed from the perspective of researchers, rarely from older adults themselves.
View Article and Find Full Text PDFFront Neuroinform
December 2024
Department of Informatics, Systems and Communication, University of Milano-Bicocca, Milan, Italy.
Introduction: Modeling multi-channel electroencephalographic (EEG) time-series is a challenging tasks, even for the most recent deep learning approaches. Particularly, in this work, we targeted our efforts to the high-fidelity reconstruction of this type of data, as this is of key relevance for several applications such as classification, anomaly detection, automatic labeling, and brain-computer interfaces.
Methods: We analyzed the most recent works finding that high-fidelity reconstruction is seriously challenged by the complex dynamics of the EEG signals and the large inter-subject variability.
Heliyon
December 2024
Xinxiang Medical University, Xinxiang, 453000, China.
This study proposes a public opinion monitoring model that combines the K-means clustering algorithm with Particle Swarm Optimization (PSO) to enhance the accuracy and effectiveness of public opinion monitoring on social media. The model's performance across various dissemination indicators is studied in detail. Through experiments conducted on social media datasets, the study comprehensively evaluates the model from four dimensions: dissemination speed, scope, depth, and sentiment dissemination effectiveness.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!