Wavelet scattering networks for atomistic systems with extrapolation of material properties.

J Chem Phys

Department of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing, Michigan 48824-1226, USA.

Published: August 2020

The dream of machine learning in materials science is for a model to learn the underlying physics of an atomic system, allowing it to move beyond the interpolation of the training set to the prediction of properties that were not present in the original training data. In addition to advances in machine learning architectures and training techniques, achieving this ambitious goal requires a method to convert a 3D atomic system into a feature representation that preserves rotational and translational symmetries, smoothness under small perturbations, and invariance under re-ordering. The atomic orbital wavelet scattering transform preserves these symmetries by construction and has achieved great success as a featurization method for machine learning energy prediction. Both in small molecules and in the bulk amorphous LiSi system, machine learning models using wavelet scattering coefficients as features have demonstrated a comparable accuracy to density functional theory at a small fraction of the computational cost. In this work, we test the generalizability of our LiSi energy predictor to properties that were not included in the training set, such as elastic constants and migration barriers. We demonstrate that statistical feature selection methods can reduce over-fitting and lead to remarkable accuracy in these extrapolation tasks.

Download full-text PDF

Source
http://dx.doi.org/10.1063/5.0016020DOI Listing

Publication Analysis

Top Keywords

machine learning
16
wavelet scattering
12
atomic system
8
training set
8
scattering networks
4
networks atomistic
4
atomistic systems
4
systems extrapolation
4
extrapolation material
4
material properties
4

Similar Publications

Exploring the role of oxidative stress in carotid atherosclerosis: insights from transcriptomic data and single-cell sequencing combined with machine learning.

Biol Direct

January 2025

National Key Laboratory for Innovation and Transformation of Luobing Theory; The Key Laboratory of Cardiovascular Remodeling and Function Research, Chinese Ministry of Education, Chinese National Health Commission and Chinese Academy of Medical Sciences, Jinan, China.

Background: Carotid atherosclerotic plaque is the primary cause of cardiovascular and cerebrovascular diseases. It is closely related to oxidative stress and immune inflammation. This bioinformatic study was conducted to identify key oxidative stress-related genes and key immune cell infiltration involved in the formation, progression, and stabilization of plaques and investigate the relationship between them.

View Article and Find Full Text PDF

Unveiling new therapeutic horizons in rheumatoid arthritis: an In-depth exploration of circular RNAs derived from plasma exosomes.

J Orthop Surg Res

January 2025

Department of Rheumatology and Immunology, Affiliated Hospital of Yangzhou University, Yangzhou University, No. 368 Hanjiang Middle Road, Yangzhou, Jiangsu, 225000, China.

Rheumatoid arthritis (RA), a chronic inflammatory joint disease causing permanent disability, involves exosomes, nanosized mammalian extracellular particles. Circular RNA (circRNA) serves as a biomarker in RA blood samples. This research screened differentially expressed circRNAs in RA patient plasma exosomes for novel diagnostic biomarkers.

View Article and Find Full Text PDF

AiGPro: a multi-tasks model for profiling of GPCRs for agonist and antagonist.

J Cheminform

January 2025

School of Systems Biomedical Science, Soongsil University, 369 Sangdo-ro, Dongjak-gu, 06978, Seoul, Republic of Korea.

G protein-coupled receptors (GPCRs) play vital roles in various physiological processes, making them attractive drug discovery targets. Meanwhile, deep learning techniques have revolutionized drug discovery by facilitating efficient tools for expediting the identification and optimization of ligands. However, existing models for the GPCRs often focus on single-target or a small subset of GPCRs or employ binary classification, constraining their applicability for high throughput virtual screening.

View Article and Find Full Text PDF

Detection of early relapse in multiple myeloma patients.

Cell Div

January 2025

Babak Myeloma Group, Department of Pathophysiology, Faculty of Medicine, Masaryk University, Brno, Czech Republic.

Background: Multiple myeloma (MM) represents the second most common hematological malignancy characterized by the infiltration of the bone marrow by plasma cells that produce monoclonal immunoglobulin. While the quality and length of life of MM patients have significantly increased, MM remains a hard-to-treat disease; almost all patients relapse. As MM is highly heterogenous, patients relapse at different times.

View Article and Find Full Text PDF

Background: Amebiasis represents a significant global health concern. This is especially evident in developing countries, where infections are more common. The primary diagnostic method in laboratories involves the microscopy of stool samples.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!