DL-TODA: A Deep Learning Tool for Omics Data Analysis.

Biomolecules

Department of Cell and Molecular Biology, College of the Environment and Life Sciences, University of Rhode Island, Kingston, RI 02881, USA.

Published: March 2023

Metagenomics is a technique for genome-wide profiling of microbiomes; this technique generates billions of DNA sequences called reads. Given the multiplication of metagenomic projects, computational tools are necessary to enable the efficient and accurate classification of metagenomic reads without needing to construct a reference database. The program DL-TODA presented here aims to classify metagenomic reads using a deep learning model trained on over 3000 bacterial species. A convolutional neural network architecture originally designed for computer vision was applied for the modeling of species-specific features. Using synthetic testing data simulated with 2454 genomes from 639 species, DL-TODA was shown to classify nearly 75% of the reads with high confidence. The classification accuracy of DL-TODA was over 0.98 at taxonomic ranks above the genus level, making it comparable with Kraken2 and Centrifuge, two state-of-the-art taxonomic classification tools. DL-TODA also achieved an accuracy of 0.97 at the species level, which is higher than 0.93 by Kraken2 and 0.85 by Centrifuge on the same test set. Application of DL-TODA to the human oral and cropland soil metagenomes further demonstrated its use in analyzing microbiomes from diverse environments. Compared to Centrifuge and Kraken2, DL-TODA predicted distinct relative abundance rankings and is less biased toward a single taxon.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10135919PMC
http://dx.doi.org/10.3390/biom13040585DOI Listing

Publication Analysis

Top Keywords

deep learning
8
metagenomic reads
8
dl-toda
7
dl-toda deep
4
learning tool
4
tool omics
4
omics data
4
data analysis
4
analysis metagenomics
4
metagenomics technique
4

Similar Publications

Deep learning-based metabolomics data study of prostate cancer.

BMC Bioinformatics

December 2024

College of Computer Science and Technology, Inner Mongolia Minzu University, Tongliao, 028000, China.

As a heterogeneous disease, prostate cancer (PCa) exhibits diverse clinical and biological features, which pose significant challenges for early diagnosis and treatment. Metabolomics offers promising new approaches for early diagnosis, treatment, and prognosis of PCa. However, metabolomics data are characterized by high dimensionality, noise, variability, and small sample sizes, presenting substantial challenges for classification.

View Article and Find Full Text PDF

Methods: We retrospectively collected CT scan data from 276 patients with pathologically confirmed primary bone tumors from 4 medical centers in Guangdong Province between January, 2010 and August, 2021. A convolutional neural network (CNN) was employed as the deep learning architecture. The optimal baseline deep learning model (R-Net) was determined through transfer learning, and an optimized model (S-Net) was obtained through algorithmic improvements.

View Article and Find Full Text PDF

Predicting craniofacial fibrous dysplasia growth status: an exploratory study of a hybrid radiomics and deep learning model based on computed tomography images.

Oral Surg Oral Med Oral Pathol Oral Radiol

November 2024

Department of Oral and Cranio-Maxillofacial Surgery, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China; College of Stomatology, Shanghai Jiao Tong University, Shanghai, China; National Center for Stomatology, Shanghai, China; National Clinical Research Center for Oral Diseases, Shanghai, China; Shanghai Key Laboratory of Stomatology, Shanghai, China. Electronic address:

Objective: This study aimed to develop 3 models based on computed tomography (CT) images of patients with craniofacial fibrous dysplasia (CFD): a radiomics model (Model Rad), a deep learning (DL) model (Model DL), and a hybrid radiomics and DL model (Model Rad+DL), and evaluate the ability of these models to distinguish between adolescents with active lesion progression and adults with stable lesion progression.

Methods: We retrospectively analyzed preoperative CT scans from 148 CFD patients treated at Shanghai Ninth People's Hospital. The images were processed using 3D-Slicer software to segment and extract regions of interest for radiomics and DL analysis.

View Article and Find Full Text PDF

Purpose: Improve the accuracy of one-stage object detection by modifying the YOLOv7 with Convolutional Block Attention Module (CBAM), known as YOLOv7-CBAM, which can automatically identify torn or intact rotator cuff tendon to assist physicians in diagnosing rotator cuff lesions through ultrasound.

Methods: Between 2020 and 2021, patients who experienced shoulder pain for over 3 months and had both ultrasound and MRI examinations were categorized into torn and intact group. To ensure balanced training, we included the same number of patients on both groups.

View Article and Find Full Text PDF

Microinfarcts and microhemorrhages are characteristic lesions of cerebrovascular disease. Although multiple studies have been published, there is no one universal standard criteria for the neuropathological assessment of cerebrovascular disease. In this study, we propose a novel application of machine learning in the automated screening of microinfarcts and microhemorrhages.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!