Knowledge discovery through chemical space networks: the case of organic electronics.

J Mol Model

Chair for Theoretical Chemistry and Catalysis Research Center, Technische Universität München, Lichtenbergstraße 4, 85747, Garching, Germany.

Published: March 2019

AI Article Synopsis

  • Modern materials discovery leverages computational screening of extensive databases, combining experimental and virtual data to compute key microscopic quantities for potential materials.
  • The study presents a Chemical Space Network (CSN) visualization of over 64,000 molecular crystals, highlighting structural similarities and providing insight into organic semiconductor design rules.
  • By mapping clusters of similar molecules, the CSN reveals regions with high potential for optimization, particularly those with promising properties yet unexplored in chemical space.

Article Abstract

Modern materials discovery and design studies often rely on the computational screening of large databases. Complementing experimental databases, virtual databases are thereby increasingly established through the first-principles calculation of computationally inexpensive, but for a given application, decisive microscopic quantities of the system. These so-called descriptors are calculated for vast numbers of candidate materials. In general, the sheer volume of datapoints generated in such studies precludes an in depth human analysis. To this end, smart visualization techniques, based e.g., on so-called chemical space networks (CSN), have been developed to extract general design rules connecting structural modifications to changes in the target functionality. In this work, we generate and visualize the CSN of possible crystalline organic semiconductors based on an in-house database of > 64,000 molecular crystals that we extracted from the exhaustive Cambridge Structural Database and for which we computed prominent charge-mobility descriptors. Our CSN thereby links clusters of molecular crystals based on the chemical similarity of the scaffolds of their molecular building blocks and thus groups communities of similar molecules. Including each cluster's median descriptor values, the CSN visualization not only reproduces known trends of good organic semiconductors but also allows us to extract general design rules for organic molecular scaffolds. Finally, the local environment of each scaffold in our visualization shows how thoroughly its local chemical space has already been explored synthetically. Of special interest here are those clusters with promising descriptor values, yet with little or no connections in the sampled chemical space, as these offer the most room for scaffold optimization.

Download full-text PDF

Source
http://dx.doi.org/10.1007/s00894-019-3950-6DOI Listing

Publication Analysis

Top Keywords

chemical space
16
space networks
8
extract general
8
general design
8
design rules
8
organic semiconductors
8
molecular crystals
8
descriptor values
8
chemical
5
knowledge discovery
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!