Quantum chemical studies of the formation and growth of atmospheric molecular clusters are important for understanding aerosol particle formation. However, the search for the lowest free-energy cluster configuration is extremely time consuming. This makes high-level benchmark data sets extremely valuable in the quest for the global minimum as it allows the identification of cost-efficient computational methodologies, as well as the development of high-level machine learning (ML) models. Herein, we present a highly versatile quantum chemical data set comprising a total of 11 749 (acid)(base) cluster configurations, containing up to 44 atoms. Utilizing the LUMI supercomputer, we calculated highly accurate PNO-CCSD(F12*)(T)/cc-pVDZ-F12 binding energies of the full set of cluster configurations leading to an unprecedented data set both in regard to sheer size and with respect to the level of theory. We employ the constructed benchmark set to assess the performance of various semiempirical and density functional theory methods. In particular, we find that the r-SCAN-3c method shows excellent performance across the data set related to both accuracy and CPU time, making it a promising method to employ during cluster configurational sampling. Furthermore, applying the data sets, we construct ML models based on Δ-learning and provide recommendations for future application of ML in cluster configurational sampling.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1021/acs.jctc.2c00825 | DOI Listing |
Acta Crystallogr D Struct Biol
February 2025
Université Paris-Saclay, Université Evry, IBISC, 91020 Evry-Courcouronnes, France.
Predicting the 3D structure of RNA is a significant challenge despite ongoing advancements in the field. Although AlphaFold has successfully addressed this problem for proteins, RNA structure prediction raises difficulties due to the fundamental differences between proteins and RNA, which hinder its direct adaptation. The latest release of AlphaFold, AlphaFold3, has broadened its scope to include multiple different molecules such as DNA, ligands and RNA.
View Article and Find Full Text PDFSurv Methodol
December 2024
Department of Statistical Science, 214a Old Chemistry Building, Duke University, Durham, NC 27708-0251.
When seeking to release public use files for confidential data, statistical agencies can generate fully synthetic data. We propose an approach for making fully synthetic data from surveys collected with complex sampling designs. Our approach adheres to the general strategy proposed by Rubin (1993).
View Article and Find Full Text PDFWorld J Clin Cases
January 2025
Antequera Hospital, Northern Málaga Antequera Integrated Healthcare Area, Antequera 29200, Málaga, Spain.
Background: Addressing the growing challenge of hospitalizing chronic multimorbid patients, this study examines the strain these conditions impose on healthcare systems at a local level, focusing on a pilot program. Chronic diseases and complex patients require comprehensive management strategies to reduce healthcare burdens and improve patient outcomes. If proven effective, this pilot model has the potential to be replicated in other healthcare settings to enhance the management of chronic multimorbid patients.
View Article and Find Full Text PDFOphthalmol Sci
November 2024
Casey Eye Institute, Oregon Health and Science University, Portland, Oregon.
Purpose: Retinopathy of prematurity (ROP) stage is defined by the visual appearance of the vascular-avascular border, which reflects a spectrum of pathologic neurovascular tissue (NVT). Previous work demonstrated that the thickness of the ridge lesion, measured using OCT, corresponds to higher clinical diagnosis of stage. This study evaluates whether the volume of anomalous NVT (ANVTV), defined as abnormal tissue protruding from the regular contour of the retina, can be measured automatically using deep learning to develop quantitative OCT-based biomarkers in ROP.
View Article and Find Full Text PDFCirculation
January 2025
Johns Hopkins Ciccarone Center for Prevention of Cardiovascular Disease, Baltimore, MD (Z.Y., E.T., Z.A.D., K.K.J., N.O., T.R., E.B., M.J.B.).
Background: Understanding the association of tobacco product use with subclinical markers is essential in evaluating health effects to inform regulatory policy. This is particularly relevant for noncigarette products (eg, cigars, pipes, and smokeless tobacco), which have been understudied because of their low prevalence in individual cohort studies.
Methods: This cross-sectional study included 98 450 participants from the Cross-Cohort Collaboration-Tobacco data set.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!