Using the gini coefficient to measure the chemical diversity of small-molecule libraries.

J Comput Chem

VIF Innovations, LLC, Gaithersburg, Maryland, 20877.

Published: August 2016

Modern databases of small organic molecules contain tens of millions of structures. The size of theoretically available chemistry is even larger. However, despite the large amount of chemical information, the "big data" moment for chemistry has not yet provided the corresponding payoff of cheaper computer-predicted medicine or robust machine-learning models for the determination of efficacy and toxicity. Here, we present a study of the diversity of chemical datasets using a measure that is commonly used in socioeconomic studies. We demonstrate the use of this diversity measure on several datasets that were constructed to contain various congeneric subsets of molecules as well as randomly selected molecules. We also apply our method to a number of well-known databases that are frequently used for structure-activity relationship modeling. Our results show the poor diversity of the common sources of potential lead compounds compared to actual known drugs. © 2016 Wiley Periodicals, Inc.

Download full-text PDF

Source
http://dx.doi.org/10.1002/jcc.24423DOI Listing

Publication Analysis

Top Keywords

gini coefficient
4
coefficient measure
4
measure chemical
4
diversity
4
chemical diversity
4
diversity small-molecule
4
small-molecule libraries
4
libraries modern
4
modern databases
4
databases small
4

Similar Publications

Objective: The purpose of this study is to analyse the changes in the equity of intensive care unit (ICU) bed allocation in 14 cities in China's Guangxi Zhuang Autonomous Region from 2018 to 2021, to identify the problems in the process of ICU bed allocation in China's ethnic minority regions.

Design: The Gini coefficient, Theil index, health resource density index, and spatial correlation analysis were used to analyse the current status of ICU bed resource allocation and allocation equity in Guangxi, China, on two dimensions: geography, and population.

Setting: The Guangxi Zhuang Autonomous Region.

View Article and Find Full Text PDF

Background: Drug use disorder (DUD) poses a major public health crisis globally, necessitating immediate attention to global trends and future projections to develop effective health policies and interventions. Thus, we aimed to estimate the global trends in DUD mortality rates from 1990 to 2021 and future projections of DUD deaths until 2040 across 73 countries.

Methods: In this time-series analysis and modelling study, we investigated the global trends in DUD mortality rates from 1990 to 2021 using the WHO Mortality Database and forecasted future trends through 2040.

View Article and Find Full Text PDF

Background: In the context of public health emergencies, the presence of medical and health talents (MHT) is critically important for support in any country or region. This study aims to analyze the spatial and temporal distributions and evolution of MHT in China and propose strategies and recommendations for promoting a balanced distribution.

Methods: This research used data from 31 provinces in China to construct a multidimensional index system for measuring the agglomeration level of MHT.

View Article and Find Full Text PDF

Modeling regional private income and its embedded carbon emissions: sources, flows and inequalities.

Sci Total Environ

January 2025

Key Laboratory of Groundwater Resources and Environment, Ministry of Education, Jilin University, Changchun 130021, China; Jilin Provincial Key Laboratory of Water Resources and Environment, Jilin University, Changchun 130021, China; College of New Energy and Environment, Jilin University, Changchun 130012, China; School of Earth and Environmental Sciences, Cardiff University, Cardiff CF10 3AT, UK. Electronic address:

Composing regional total income jointly with government income, private income represents levels of development and affluence from the household perspective. Considering the need for fair carbon emission reduction responsibility distributions among regions with divergent income levels, private income-embedded emission (PIEE) and the inter-regional inequalities remain to be explored. Combining input-output analysis and the Gini coefficient, this study traces the sources and disposals of regional private income in China, as well as their embedded carbon emission flow, and quantifies the distribution and inequality of PIEE across industrial sectors and provincial regions.

View Article and Find Full Text PDF

Quantitative Indicators of Microstructure and Texture Heterogeneity in Polycrystalline System.

Materials (Basel)

December 2024

Eötvös Lorand University (ELTE), Faculty of Informatics, Savaria Institute of Technology, Karolyi Gaspar tér 4, 9700 Szombathely, Hungary.

The microstructural features of polycrystals determine numerous properties, whereas the evolution of crystallographic texture is responsible for the anisotropy of particular properties. Therefore, it is of crucial importance to find proper quantitative indicators, which reflect the nature of microstructure and texture characteristics. This is partially performed by the assessment of the average grain size and texture intensity that provide basic information on the microstructural features evolved; however, often, the basic quantitative indicators are not capable of revealing the complete microstructural state especially when the system is highly heterogeneous.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!