Use of Benford's law in drug discovery data.

Drug Discov Today

Chemistry Research Labs, Drug Discovery Research, Astellas Pharma Inc., 21 Miyukigaoka, Tsukuba, Ibaraki 305-8585, Japan.

Published: May 2010

Benford's law states that the distribution of the first digit of many data sets is not uniform. The first digit of any random number will be 1 almost 30% of the time, and larger digits occur as the first digit with lower and lower frequency, to the point where 9 occurs as a first digit only 5% of the time. Here, we demonstrate that several data sets in the field of drug discovery follow Benford's distribution, whereas 'doctored' data do not. Our findings indicate the applicability of Benford's law in assessing data quality in the field of drug discovery. We also propose a useful index of evaluating data quality based on Benford's law.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.drudis.2010.03.003DOI Listing

Publication Analysis

Top Keywords

benford's law
16
drug discovery
12
data sets
8
field drug
8
data quality
8
data
6
benford's
5
law drug
4
discovery data
4
data benford's
4

Similar Publications

Assessing the scientific integrity of the collected work of one author or author-group.

J Clin Epidemiol

January 2025

Department of Obstetrics and Gynaecology, Monash University, Clayton, Australia; Aberdeen Centre for Women's Health Research, School of Medicine, University of Aberdeen, Aberdeen, UK. Electronic address:

Objective: No published methods for research integrity review include both statistical techniques applied to groups of randomised trials and individual assessment of papers. We propose a method based on practical experience of investigating data integrity across the collected papers of one author or author-group.

Study Design And Setting: We report our approach to investigating the collected papers of an author or author-group suspected of academic misconduct.

View Article and Find Full Text PDF

Are managers of emerging markets more opportunistic? application of Benford's Law.

PLoS One

December 2024

Faculty of Management, Department of Economics and Finance, Comenius University Bratislava, Bratislava, Slovakia.

Purpose: The current research analyzes cosmetic earnings management practices in emerging and developed markets before and after the global financial crisis.

Design/methodology/approach: Using digital analysis, by applying Benford's Law the study analyzes the earnings adjustments that exceed a key reference point to determine whether earnings management anomaly exists or not? Based on a sample of 87165 firm-year observations of UK, US, Brazil, Russia, India, China and Pakistan listed corporations.

Findings: Findings show that the managers of emerging markets have more incentive to manipulate earnings than their counterparts from developed markets.

View Article and Find Full Text PDF

The inefficient effects of non-clinical factors on health care costs.

Health Econ Policy Law

September 2024

Department of Finance, Insurance and Real Estate, University of Memphis, Memphis, TN, USA.

We use Benford's law to examine the non-random elements of health care costs. We find that as health care expenditures increase, the conformity to the expected distribution of naturally occurring numbers worsens, indicating a tendency towards inefficient treatment. Government insurers follow Benford's law better than private insurers indicating more efficient treatment.

View Article and Find Full Text PDF

This paper utilizes Benford's law, the distribution that the first significant digit of numbers in certain datasets should follow, to assess the accuracy of self-reported health expenditure data known for measurement errors. We provide both simulation and real data evidence supporting the validity assumption that genuine health expenditure data conform to Benford's law. We then conduct a Benford analysis of health expenditure variables from two widely utilized public datasets, the China Health and Nutrition Survey and the China Family Panel Studies.

View Article and Find Full Text PDF
Article Synopsis
  • Some COVID-19 testing centers have been found to manipulate test numbers, leading to a study that compared traditional fraud detection methods with statistical approaches to identify potential fraud.
  • The study used various statistical techniques, including outlier detection and regression modeling, analyzing billing data from 907 testing centers in a German city between April 2021 and August 2022.
  • Results showed that statistical methods were more effective in identifying suspicious centers, with significantly higher specificity compared to traditional methods, suggesting the need for better data tracking in future health crises.
View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!