The implementation of machine learning models has brought major changes in the decision-making process for materials design. One matter of concern for the data-driven approaches is the lack of negative data from unsuccessful synthetic attempts, which might generate inherently imbalanced datasets. We propose the application of the one-class classification methodology as an effective tool for tackling these limitations on the materials design problems. This is a concept of learning based only on a well-defined class without counter examples. An extensive study on the different one-class classification algorithms is performed until the most appropriate workflow is identified for guiding the discovery of emerging materials belonging to a relatively small class, that being the weakly bound polyaromatic hydrocarbon co-crystals. The two-step approach presented in this study first trains the model using all the known molecular combinations that form this class of co-crystals extracted from the Cambridge Structural Database (1722 molecular combinations), followed by scoring possible yet unknown pairs from the ZINC15 database (21 736 possible molecular combinations). Focusing on the highest-ranking pairs predicted to have higher probability of forming co-crystals, materials discovery can be accelerated by reducing the vast molecular space and directing the synthetic efforts of chemists. Further on, using interpretability techniques a more detailed understanding of the molecular properties causing co-crystallization is sought after. The applicability of the current methodology is demonstrated with the discovery of two novel co-crystals, namely pyrene-6-benzo[]chromen-6-one () and pyrene-9,10-dicyanoanthracene ().

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8179233PMC
http://dx.doi.org/10.1039/d0sc04263cDOI Listing

Publication Analysis

Top Keywords

molecular combinations
12
materials design
8
one-class classification
8
molecular
5
class
4
class classification
4
classification practical
4
practical approach
4
approach accelerating
4
accelerating π-π
4

Similar Publications

Biophysical constraints limit the specificity with which transcription factors (TFs) can target regulatory DNA. While individual nontarget binding events may be low affinity, the sheer number of such interactions could present a challenge for gene regulation by degrading its precision or possibly leading to an erroneous induction state. Chromatin can prevent nontarget binding by rendering DNA physically inaccessible to TFs, at the cost of energy-consuming remodeling orchestrated by pioneer factors (PFs).

View Article and Find Full Text PDF

Dissolution of CO in water followed by the subsequent hydrolysis reactions is of great importance to the global carbon cycle, and carbon capture and storage. Despite numerous previous studies, the reactions are still not fully understood at the atomistic scale. Here, we combined ab initio molecular dynamics (AIMD) simulations with Markov state models to elucidate the reaction mechanisms and kinetics of CO in supercritical water both in the bulk and nanoconfined states.

View Article and Find Full Text PDF

In species with genetic sex determination (GSD), the sex identity of the soma determines germ cell fate. For example, in mice, XY germ cells that enter an ovary differentiate as oogonia, whereas XX germ cells that enter a testis initiate differentiation as spermatogonia. However, numerous species lack a GSD system and instead display temperature-dependent sex determination (TSD).

View Article and Find Full Text PDF

Matrigel/BME, a basement membrane-like preparation, supports long-term growth of epithelial 3D organoids from adult stem cells [T. Sato , , 262-265 (2009); T. Sato , , 1762-1772 (2011)].

View Article and Find Full Text PDF

Preeclampsia is characterized by insufficient invasion of extravillous trophoblasts and is a consequence of failed adaption of extravillous trophoblasts to changes in the intrauterine environment developing embryo. Specific miRNAs are implicated in the development of preeclampsia (PE). miR-455-5p is present at low levels in PE but its role is not known.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!