Big data has ushered in a new wave of predictive power using machine-learning models. In this work, we assess what big means in the context of typical materials-science machine-learning problems. This concerns not only data volume, but also data quality and veracity as much as infrastructure issues.
View Article and Find Full Text PDFActive matter spans a wide range of time and length scales, from groups of cells and synthetic self-propelled colloids to schools of fish and flocks of birds. The theoretical framework describing these systems has shown tremendous success in finding universal phenomenology. However, further progress is often burdened by the difficulty of determining forces controlling the dynamics of individual elements within each system.
View Article and Find Full Text PDFAccurate and explainable artificial-intelligence (AI) models are promising tools for accelerating the discovery of new materials. Recently, symbolic regression has become an increasingly popular tool for explainable AI because it yields models that are relatively simple analytical descriptions of target properties. Due to its deterministic nature, the sure-independence screening and sparsifying operator (SISSO) method is a particularly promising approach for this application.
View Article and Find Full Text PDFThe expansive production of data in materials science, their widespread sharing and repurposing requires educated support and stewardship. In order to ensure that this need helps rather than hinders scientific work, the implementation of the FAIR-data principles () must not be too narrow. Besides, the wider materials-science community ought to agree on the strategies to tackle the challenges that are specific to its data, both from computations and experiments.
View Article and Find Full Text PDFA reliable description of surfaces structures in a reactive environment is crucial to understand materials' functions. We present a first-principles theory of replica-exchange grand-canonical-ensemble molecular dynamics and apply it to evaluate phase equilibria of surfaces in a reactive gas-phase environment. We identify the different surface phases and locate phase boundaries including triple and critical points.
View Article and Find Full Text PDFThe design of heterogeneous catalysts is challenged by the complexity of materials and processes that govern reactivity and by the fact that the number of good catalysts is very small in comparison to the number of possible materials. Here, we show how the subgroup-discovery (SGD) artificial-intelligence approach can be applied to an experimental plus theoretical data set to identify constraints on key physicochemical parameters, the so-called SG , which exclusively describe materials and reaction conditions with outstanding catalytic performance. By using high-throughput experimentation, 120 SiO-supported catalysts containing ruthenium, tungsten, and phosphorus were synthesized and tested in the catalytic oxidation of propylene.
View Article and Find Full Text PDFUnlabelled: In order to estimate the reactivity of a large number of potentially complex heterogeneous catalysts while searching for novel and more efficient materials, physical as well as data-centric models have been developed for a faster evaluation of adsorption energies compared to first-principles calculations. However, models designed to describe as many materials as possible might overlook the very few compounds that have the appropriate adsorption properties to be suitable for a given catalytic process. Here, the subgroup-discovery (SGD) artificial-intelligence approach is used to identify the key descriptive parameters and constrains on their values, the so-called SG rules, which particularly describe transition-metal surfaces with outstanding adsorption properties for the oxygen-reduction and -evolution reactions.
View Article and Find Full Text PDFCatalytic-materials design requires predictive modeling of the interaction between catalyst and reactants. This is challenging due to the complexity and diversity of structure-property relationships across the chemical space. Here, we report a strategy for a rational design of catalytic materials using the artificial intelligence approach (AI) subgroup discovery.
View Article and Find Full Text PDFDue to their ability to recognize complex patterns, neural networks can drive a paradigm shift in the analysis of materials science data. Here, we introduce ARISE, a crystal-structure identification method based on Bayesian deep learning. As a major step forward, ARISE is robust to structural noise and can treat more than 100 crystal structures, a number that can be extended on demand.
View Article and Find Full Text PDFDrug efficacy depends on its capacity to permeate across the cell membrane. We consider the prediction of passive drug-membrane permeability coefficients. Beyond the widely recognized correlation with hydrophobicity, we additionally consider the functional relationship between passive permeation and acidity.
View Article and Find Full Text PDFAbstract: The performance in heterogeneous catalysis is an example of a complex materials function, governed by an intricate interplay of several processes (e.g., the different surface chemical reactions, and the dynamic restructuring of the catalyst material at reaction conditions).
View Article and Find Full Text PDFAlthough machine learning (ML) models promise to substantially accelerate the discovery of novel materials, their performance is often still insufficient to draw reliable conclusions. Improved ML models are therefore actively researched, but their design is currently guided mainly by monitoring the average model test error. This can render different models indistinguishable although their performance differs substantially across materials, or it can make a model appear generally insufficient while it actually works well in specific sub-domains.
View Article and Find Full Text PDFNanostructured materials are essential building blocks for the fabrication of new devices for energy harvesting/storage, sensing, catalysis, magnetic, and optoelectronic applications. However, because of the increase of technological needs, it is essential to identify new functional materials and improve the properties of existing ones. The objective of this Viewpoint is to examine the state of the art of atomic-scale simulative and experimental protocols aimed to the design of novel functional nanostructured materials, and to present new perspectives in the relative fields.
View Article and Find Full Text PDFPredicting the stability of the perovskite structure remains a long-standing challenge for the discovery of new functional materials for many applications including photovoltaics and electrocatalysts. We developed an accurate, physically interpretable, and one-dimensional tolerance factor, τ, that correctly predicts 92% of compounds as perovskite or nonperovskite for an experimental dataset of 576 materials ( = O, F, Cl, Br, I) using a novel data analytics approach based on SISSO (sure independence screening and sparsifying operator). τ is shown to generalize outside the training set for 1034 experimentally realized single and double perovskites (91% accuracy) and is applied to identify 23,314 new double perovskites ( ) ranked by their probability of being stable as perovskite.
View Article and Find Full Text PDFGold nanoclusters have been the focus of numerous computational studies, but an atomistic understanding of their structural and dynamical properties at finite temperature is far from satisfactory. To address this deficiency, we investigate gold nanoclusters via ab initio molecular dynamics, in a range of sizes where a core-shell morphology is observed. We analyze their structure and dynamics using state-of-the-art techniques, including unsupervised machine-learning nonlinear dimensionality reduction (sketch-map) for describing the similarities and differences among the range of sampled configurations.
View Article and Find Full Text PDFComputational methods that automatically extract knowledge from data are critical for enabling data-driven materials science. A reliable identification of lattice symmetry is a crucial first step for materials characterization and analytics. Current methods require a user-specified threshold, and are unable to detect average symmetries for defective structures.
View Article and Find Full Text PDFWe present the implementation of GAtor, a massively parallel, first-principles genetic algorithm (GA) for molecular crystal structure prediction. GAtor is written in Python and currently interfaces with the FHI-aims code to perform local optimizations and energy evaluations using dispersion-inclusive density functional theory (DFT). GAtor offers a variety of fitness evaluation, selection, crossover, and mutation schemes.
View Article and Find Full Text PDFLigand-protected Au clusters are non-bleaching fluorescence markers in bio- and medical applications. Here we show that their fluorescence can be an intrinsic property of the Au cluster itself. We find a very intense and sharp fluorescence peak located at λ=739.
View Article and Find Full Text PDFActa Crystallogr B Struct Sci Cryst Eng Mater
August 2016
The sixth blind test of organic crystal structure prediction (CSP) methods has been held, with five target systems: a small nearly rigid molecule, a polymorphic former drug candidate, a chloride salt hydrate, a co-crystal and a bulky flexible molecule. This blind test has seen substantial growth in the number of participants, with the broad range of prediction methods giving a unique insight into the state of the art in the field. Significant progress has been seen in treating flexible molecules, usage of hierarchical approaches to ranking structures, the application of density-functional approximations, and the establishment of new workflows and `best practices' for performing CSP calculations.
View Article and Find Full Text PDFFormation of partly dissociated water chains is observed on CaO(001) films upon water exposure at 300 K. While morphology and orientation of the 1D assemblies are revealed from scanning tunneling microscopy, their atomic structure is identified with infrared absorption spectroscopy combined with density functional theory calculations. The latter exploit an ab initio genetic algorithm linked to atomistic thermodynamics to determine low-energy H2O configurations on the oxide surface.
View Article and Find Full Text PDFStatistical learning of materials properties or functions so far starts with a largely silent, nonchallenged step: the choice of the set of descriptive parameters (termed descriptor). However, when the scientific connection between the descriptor and the actuating mechanisms is unclear, the causality of the learned descriptor-property relation is uncertain. Thus, a trustful prediction of new promising materials, identification of anomalies, and scientific advancement are doubtful.
View Article and Find Full Text PDFBy applying a genetic algorithm and ab initio atomistic thermodynamics, we identify the stable and metastable compositions and structures of MgMOx clusters at realistic temperatures and oxygen pressures. We find that small clusters (M≲5) are in thermodynamic equilibrium when x>M. The nonstoichiometric clusters exhibit peculiar magnetic behavior, suggesting the possibility of tuning magnetic properties by changing environmental pressure and temperature conditions.
View Article and Find Full Text PDFWater-metal interfaces are ubiquitous and play a key role in many chemical processes, from catalysis to corrosion. Whereas water adlayers on atomically flat transition metal surfaces have been investigated in depth, little is known about the chemistry of water on stepped surfaces, commonly occurring in realistic situations. Using first-principles simulations, we study the adsorption of water on a stepped platinum surface.
View Article and Find Full Text PDF