Deepmol: an automated machine and deep learning framework for computational chemistry.

J Cheminform

CEB - Centre of Biological Engineering, University of Minho, Braga, Portugal.

Published: December 2024

AI Article Synopsis

  • The introduction of Machine Learning technologies has transformed computational chemistry, but challenges like algorithm selection and data pre-processing remain.
  • DeepMol addresses these issues as a pioneering AutoML tool that automates crucial steps in the ML pipeline, effectively optimizing methods for predicting molecular properties.
  • With competitive performance on benchmark datasets and robust features such as open-source code, comprehensive documentation, and support for various models, DeepMol establishes itself as a leading tool in the computational chemistry domain.

Article Abstract

The domain of computational chemistry has experienced a significant evolution due to the introduction of Machine Learning (ML) technologies. Despite its potential to revolutionize the field, researchers are often encumbered by obstacles, such as the complexity of selecting optimal algorithms, the automation of data pre-processing steps, the necessity for adaptive feature engineering, and the assurance of model performance consistency across different datasets. Addressing these issues head-on, DeepMol stands out as an Automated ML (AutoML) tool by automating critical steps of the ML pipeline. DeepMol rapidly and automatically identifies the most effective data representation, pre-processing methods and model configurations for a specific molecular property/activity prediction problem. On 22 benchmark datasets, DeepMol obtained competitive pipelines compared with those requiring time-consuming feature engineering, model design and selection processes. As one of the first AutoML tools specifically developed for the computational chemistry domain, DeepMol stands out with its open-source code, in-depth tutorials, detailed documentation, and examples of real-world applications, all available at https://github.com/BioSystemsUM/DeepMol and https://deepmol.readthedocs.io/en/latest/ . By introducing AutoML as a groundbreaking feature in computational chemistry, DeepMol establishes itself as the pioneering state-of-the-art tool in the field.Scientific contributionDeepMol aims to provide an integrated framework of AutoML for computational chemistry. DeepMol provides a more robust alternative to other tools with its integrated pipeline serialization, enabling seamless deployment using the fit, transform, and predict paradigms. It uniquely supports both conventional and deep learning models for regression, classification and multi-task, offering unmatched flexibility compared to other AutoML tools. DeepMol's predefined configurations and customizable objective functions make it accessible to users at all skill levels while enabling efficient and reproducible workflows. Benchmarking on diverse datasets demonstrated its ability to deliver optimized pipelines and superior performance across various molecular machine-learning tasks.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11622685PMC
http://dx.doi.org/10.1186/s13321-024-00937-7DOI Listing

Publication Analysis

Top Keywords

computational chemistry
20
deep learning
8
chemistry domain
8
feature engineering
8
deepmol stands
8
automl tools
8
chemistry deepmol
8
deepmol
7
computational
5
chemistry
5

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!