MICROPHERRET: MICRObial PHEnotypic tRait ClassifieR using Machine lEarning Techniques.

Edoardo Bizzotto Sofia Fraulini Guido Zampieri Esteban Orellana Laura Treu Stefano Campanaro

Environ Microbiome

Department of Biology, University of Padova, Padova, 35131, Italy.

Published: August 2024

Background: In recent years, there has been a rapid increase in the number of microbial genomes reconstructed through shotgun sequencing, and obtained by newly developed approaches including metagenomic binning and single-cell sequencing. However, our ability to functionally characterize these genomes by experimental assays is orders of magnitude less efficient. Consequently, there is a pressing need for the development of swift and automated strategies for the functional classification of microbial genomes.

Results: The present work leverages a suite of supervised machine learning algorithms to establish a range of 86 metabolic and other ecological functions, such as methanotrophy and plastic degradation, starting from widely obtainable microbial genome annotations. Tests performed on independent datasets demonstrated robust performance across complete, fragmented, and incomplete genomes above a 70% completeness level for most of the considered functions. Application of the algorithms to the Biogas Microbiome database yielded predictions broadly consistent with current biological knowledge and correctly detecting functionally-related nuances of archaeal genomes. Finally, a case study focused on acetoclastic methanogenesis demonstrated how the developed machine learning models can be refined or expanded with models describing novel functions of interest.

Conclusions: The resulting tool, MICROPHERRET, incorporates a total of 86 models, one for each tested functional class, and can be applied to high-quality microbial genomes as well as to low-quality genomes derived from metagenomics and single-cell sequencing. MICROPHERRET can thus aid in understanding the functional role of newly generated genomes within their micro-ecological context.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11308548	PMC
http://dx.doi.org/10.1186/s40793-024-00600-6	DOI Listing

Publication Analysis

Top Keywords

machine learning

microbial genomes

single-cell sequencing

genomes

micropherret microbial

microbial phenotypic

phenotypic trait

trait classifier

classifier machine

learning techniques

Similar Publications

Deep learning-based design and experimental validation of a medicine-like human antibody library.

Brief Bioinform

November 2024

Biotherapeutics Molecule Discovery, Boehringer Ingelheim Pharmaceutical Inc., 900 Ridgebury Road, Ridgefield, CT 06877, United States.

Nandhini Rajagopal Udit Choudhary Kenny Tsang Kyle P Martin Murat Karadag

Antibody generation requires the use of one or more time-consuming methods, namely animal immunization, and in vitro display technologies. However, the recent availability of large amounts of antibody sequence and structural data in the public domain along with the advent of generative deep learning algorithms raises the possibility of computationally generating novel antibody sequences with desirable developability attributes. Here, we describe a deep learning model for computationally generating libraries of highly human antibody variable regions whose intrinsic physicochemical properties resemble those of the variable regions of the marketed antibody-based biotherapeutics (medicine-likeness).

View Article and Find Full Text PDF

Similar Publications

External validation of the SORG machine learning for 90-day and 1-year mortality in patients suffering from extremity metastatic disease in an European cohort of 174 patients.

Acta Orthop Belg

September 2024

T M de Groot A A Sommerkamp Q C B S Thio A V Karhade O Q Groot

Accurate survival prediction of patients with long-bone metastases is challenging, but important for optimizing treatment. The Skeletal Oncology Research Group (SORG) machine learning algorithm (MLA) has been previously developed and internally validated to predict 90-day and 1-year survival. External validation showed promise in the United States and Taiwan.

View Article and Find Full Text PDF

Similar Publications

Evaluation of an Interdisciplinary Educational Program to Foster Learning Health Systems: Education Evaluation.

JMIR Med Educ

January 2025

Centre for Digital Transformation of Health, University of Melbourne, Carlton, Australia.

Sathana Dushyanthen Nadia Izzati Zamri Wendy Chapman Daniel Capurro Kayley Lyons

Background: Learning health systems (LHS) have the potential to use health data in real time through rapid and continuous cycles of data interrogation, implementing insights to practice, feedback, and practice change. However, there is a lack of an appropriately skilled interprofessional informatics workforce that can leverage knowledge to design innovative solutions. Therefore, there is a need to develop tailored professional development training in digital health, to foster skilled interprofessional learning communities in the health care workforce in Australia.

View Article and Find Full Text PDF

Similar Publications

Detecting anomalies in smart wearables for hypertension: a deep learning mechanism.

Front Public Health

January 2025

Department of Computer Science, College of Engineering and Computer Science, Jazan University, Jazan, Saudi Arabia.

C Kishor Kumar Reddy Vijaya Sindhoori Kaza R Madana Mohana Mohammed Alhameed Fathe Jeribi

Introduction: The growing demand for real-time, affordable, and accessible healthcare has underscored the need for advanced technologies that can provide timely health monitoring. One such area is predicting arterial blood pressure (BP) using non-invasive methods, which is crucial for managing cardiovascular diseases. This research aims to address the limitations of current healthcare systems, particularly in remote areas, by leveraging deep learning techniques in Smart Health Monitoring (SHM).

View Article and Find Full Text PDF

Similar Publications

Evaluating accuracy and reproducibility of large language model performance on critical care assessments in pharmacy education.

Front Artif Intell

January 2025

Department of Clinical and Administrative Pharmacy, University of Georgia College of Pharmacy, Augusta, GA, United States.

Huibo Yang Mengxuan Hu Amoreena Most W Anthony Hawkins Brian Murray

Background: Large language models (LLMs) have demonstrated impressive performance on medical licensing and diagnosis-related exams. However, comparative evaluations to optimize LLM performance and ability in the domain of comprehensive medication management (CMM) are lacking. The purpose of this evaluation was to test various LLMs performance optimization strategies and performance on critical care pharmacotherapy questions used in the assessment of Doctor of Pharmacy students.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!