An open-source framework for fast-yet-accurate calculation of quantum mechanical features.

Phys Chem Chem Phys

Data Science and Modelling, Pharmaceutical Sciences, R & D, AstraZeneca, Gothenburg, Sweden.

Published: May 2022

We present the open-source framework that enables the efficient and robust calculation of quantum mechanical features for atoms and molecules. For a benchmark set of 49 experimental molecular polarizabilities, the predictive power of the presented method competes against second-order perturbation theory in a converged atomic-orbital basis set at a fraction of its computational costs. The calculation of isotropic molecular polarizabilities is robust for a data set of more than 80 000 molecules. We present furthermore a generally applicable van der Waals radius model that is rooted on atomic static polarizabilites. Efficiency tests show that such radii can even be calculated for small- to medium-size proteins where the largest system (SARS-CoV-2 spike protein) has 42 539 atoms. Following the work of Domingo-Alemenara [Domingo-Alemenara , 2019, , 5811], we present computational predictions for retention times for different chromatographic methods and describe how physicochemical features improve the predictive power of machine-learning models that otherwise only rely on two-dimensional features like molecular fingerprints. Additionally, we developed an internal benchmark set of experimental super-critical fluid chromatography retention times. For those methods, improvements of up to 10.6% are obtained when combining molecular fingerprints with physicochemical descriptors. Shapley additive explanation values show furthermore that the physical nature of the applied features can be retained within the final machine-learning models. We generally recommend the framework as a robust, low-cost, and physically motivated featurizer for upcoming state-of-the-art machine-learning studies.

Download full-text PDF

Source
http://dx.doi.org/10.1039/d2cp01165dDOI Listing

Publication Analysis

Top Keywords

open-source framework
8
calculation quantum
8
quantum mechanical
8
mechanical features
8
benchmark set
8
set experimental
8
molecular polarizabilities
8
predictive power
8
retention times
8
machine-learning models
8

Similar Publications

Objective: To systematically assess the current state of speech-language-hearing (SLH) practices in health services addressing vocal care for transgender individuals, aiming to identify key themes and gaps in the existing body of knowledge.

Methods: This scoping review was based on the Joanna Briggs Institute manual and followed the recommendations of the Preferred Reporting Items for Systematic reviews and Meta-Analyses-Extension for Scoping Reviews. It was registered with the Open Science Framework Open Source 10.

View Article and Find Full Text PDF

CohortDiagnostics: Phenotype evaluation across a network of observational data sources using population-level characterization.

PLoS One

January 2025

Observational Health Data Analytics, Janssen Research and Development, LLC, Titusville, NJ, United States of America.

Objective: This paper introduces a novel framework for evaluating phenotype algorithms (PAs) using the open-source tool, Cohort Diagnostics.

Materials And Methods: The method is based on several diagnostic criteria to evaluate a patient cohort returned by a PA. Diagnostics include estimates of incidence rate, index date entry code breakdown, and prevalence of all observed clinical events prior to, on, and after index date.

View Article and Find Full Text PDF

Decentralized Finance (DeFi) is reshaping traditional finance by enabling direct transactions without intermediaries, creating a rich source of open financial data. Layer 2 (L2) solutions are emerging to enhance the scalability and efficiency of the DeFi ecosystem, surpassing Layer 1 (L1) systems. However, the impact of L2 solutions is still underexplored, mainly due to the lack of comprehensive transaction data indices for economic analysis.

View Article and Find Full Text PDF

Background: Platform trials are innovative clinical trials governed by a master protocol that allows for the evaluation of multiple investigational treatments that enter and leave the trial over time. Interest in platform trials has been steadily increasing over the last decade. Due to their highly adaptive nature, platform trials provide sufficient flexibility to customize important trial design aspects to the requirements of both the specific disease under investigation and the different stakeholders.

View Article and Find Full Text PDF

Trait-based approaches are revolutionizing our understanding of high-diversity ecosystems by providing insights into the principles underlying key ecological processes, such as community assembly, species distribution, resilience, and the relationship between biodiversity and ecosystem functioning. In 2016, the Coral Trait Database advanced coral reef science by centralizing trait information for stony corals (i.e.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!