Data-driven synthesis planning has seen remarkable successes in recent years by virtue of modern approaches of artificial intelligence that efficiently exploit vast databases with experimental data on chemical reactions. However, this success story is intimately connected to the availability of existing experimental data. It may well occur in retrosynthetic and synthesis design tasks that predictions in individual steps of a reaction cascade are affected by large uncertainties. In such cases, it will, in general, not be easily possible to provide missing data from autonomously conducted experiments on demand. However, first-principles calculations can, in principle, provide missing data to enhance the confidence of an individual prediction or for model retraining. Here, we demonstrate the feasibility of such an ansatz and examine resource requirements for conducting autonomous first-principles calculations on demand.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10259370PMC
http://dx.doi.org/10.1039/d3dd00006kDOI Listing

Publication Analysis

Top Keywords

experimental data
8
provide missing
8
missing data
8
first-principles calculations
8
data
5
quantum chemical
4
chemical data
4
data generation
4
generation fill-in
4
fill-in reliability
4

Similar Publications

The microenvironment cell index is a novel indicator for the prognosis and therapeutic regimen selection of cancers.

J Transl Med

January 2025

Department of Stem Cell and Regenerative Medicine, Southwest Cancer Center, Southwest Hospital, Third Military Medical University (Army Medical University), Chongqing, 400038, China.

Background: It is worthwhile to establish a prognostic prediction model based on microenvironment cells (MCs) infiltration and explore new treatment strategies for triple-negative breast cancer (TNBC).

Methods: The xCell algorithm was used to quantify the cellular components of the TNBC microenvironment based on bulk RNA sequencing (bulk RNA-seq) data. The MCs index (MCI) was constructed using the least absolute shrinkage and selection operator Cox (LASSO-Cox) regression analysis.

View Article and Find Full Text PDF

During public health crises such as pandemics, governments must rapidly adopt and implement wide-reaching policies and programs ("public policy interventions"). A key takeaway from the coronavirus disease 2019 (COVID-19) pandemic was that although numerous randomized controlled trials (RCTs) focussed on drugs and vaccines, few policy experiments were conducted to evaluate effects of public policy interventions across various sectors on viral transmission and other consequences. Moreover, many quasi-experimental studies were of spurious quality, thus proving unhelpful for informing public policy.

View Article and Find Full Text PDF

Background: Lung cancer is the first cause of cancer-related death. Awake lung resection is a new frontier of the concept of minimally invasive surgery. Our purpose is to demonstrate the feasibility of this technique for lobar and sublobar lung resection in NSCLC patients.

View Article and Find Full Text PDF

Current strategies centred on either merging or linking initial hits from fragment-based drug design (FBDD) crystallographic screens generally do not fully leaverage 3D structural information. We show that an algorithmic approach (Fragmenstein) that 'stitches' the ligand atoms from this structural information together can provide more accurate and reliable predictions for protein-ligand complex conformation than general methods such as pharmacophore-constrained docking. This approach works under the assumption of conserved binding: when a larger molecule is designed containing the initial fragment hit, the common substructure between the two will adopt the same binding mode.

View Article and Find Full Text PDF

Background: Human kinesin family member 11 (KIF11) plays a vital role in regulating the cell cycle and is implicated in the tumorigenesis and progression of various cancers, but its role in endometrial cancer (EC) is still unclear. Our current research explored the prognostic value, biological function and targeting strategy of KIF11 in EC through approaches including bioinformatics, machine learning and experimental studies.

Methods: The GSE17025 dataset from the GEO database was analyzed via the limma package to identify differentially expressed genes (DEGs) in EC.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!