DeePaC: predicting pathogenic potential of novel DNA with reverse-complement neural networks.

Bioinformatics

Bioinformatics Unit (MF1), Department of Methodology and Research Infrastructure, Robert Koch Institute, 13353 Berlin, Germany.

Published: January 2020

Motivation: We expect novel pathogens to arise due to their fast-paced evolution, and new species to be discovered thanks to advances in DNA sequencing and metagenomics. Moreover, recent developments in synthetic biology raise concerns that some strains of bacteria could be modified for malicious purposes. Traditional approaches to open-view pathogen detection depend on databases of known organisms, which limits their performance on unknown, unrecognized and unmapped sequences. In contrast, machine learning methods can infer pathogenic phenotypes from single NGS reads, even though the biological context is unavailable.

Results: We present DeePaC, a Deep Learning Approach to Pathogenicity Classification. It includes a flexible framework allowing easy evaluation of neural architectures with reverse-complement parameter sharing. We show that convolutional neural networks and LSTMs outperform the state-of-the-art based on both sequence homology and machine learning. Combining a deep learning approach with integrating the predictions for both mates in a read pair results in cutting the error rate almost in half in comparison to the previous state-of-the-art.

Availability And Implementation: The code and the models are available at: https://gitlab.com/rki_bioinformatics/DeePaC.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btz541DOI Listing

Publication Analysis

Top Keywords

neural networks
8
machine learning
8
deep learning
8
learning approach
8
deepac predicting
4
predicting pathogenic
4
pathogenic potential
4
potential novel
4
novel dna
4
dna reverse-complement
4

Similar Publications

CDCG-UNet: Chaotic Optimization Assisted Brain Tumor Segmentation Based on Dilated Channel Gate Attention U-Net Model.

Neuroinformatics

January 2025

Department of Information Technology, Faculty of Engineering and Technology, SRM Institute of Science and Technology, Ramapuram, Chennai, 600089, India.

Brain tumours are one of the most deadly and noticeable types of cancer, affecting both children and adults. One of the major drawbacks in brain tumour identification is the late diagnosis and high cost of brain tumour-detecting devices. Most existing approaches use ML algorithms to address problems, but they have drawbacks such as low accuracy, high loss, and high computing cost.

View Article and Find Full Text PDF

Exploiting biomimetic perception of invisible spectra in flexible artificial human vision systems (HVSs) is crucial for real-time dynamic information processing. Nevertheless, the fast processing of motion objects in natural environments poses a challenge, necessitating that these artificial HVSs simultaneously have swift photoresponse and nonvolatile memory. Here, inspired by the human retina, we propose a flexible UV neuromorphic visual synaptic device (NeuVSD) based on GaO@GaN-composited nanowires for dynamic visual perception.

View Article and Find Full Text PDF

Gait Video-Based Prediction of Severity of Cerebellar Ataxia Using Deep Neural Networks.

Mov Disord

January 2025

Department of Neurology, Faculty of Medicine and Graduate School of Medicine, Hokkaido University, Sapporo, Japan.

Background: Pose estimation algorithms applied to two-dimensional videos evaluate gait disturbances; however, a few studies have used this method to evaluate ataxic gait.

Objective: The aim was to assess whether a pose estimation algorithm can predict the severity of cerebellar ataxia by applying it to gait videos.

Methods: We video-recorded 66 patients with degenerative cerebellar diseases performing the timed up-and-go test.

View Article and Find Full Text PDF

AggNet: Advancing protein aggregation analysis through deep learning and protein language model.

Protein Sci

February 2025

Computer Science Program, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia.

Protein aggregation is critical to various biological and pathological processes. Besides, it is also an important property in biotherapeutic development. However, experimental methods to profile protein aggregation are costly and labor-intensive, driving the need for more efficient computational alternatives.

View Article and Find Full Text PDF

In the realm of artificial intelligence-driven drug discovery (AIDD), accurately predicting the influence of molecular structures on their properties is a critical research focus. While deep learning models based on graph neural networks (GNNs) have made significant advancements in this area, prior studies have primarily concentrated on molecule-level representations, often neglecting the impact of functional group structures and the potential relationships between fragments on molecular property predictions. To address this gap, we introduce the multi-scale feature attention graph neural network (MfGNN), which enhances traditional atom-based molecular graph representations by incorporating fragment-level representations derived from chemically synthesizable BRICS fragments.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!