Growing debates about algorithmic bias in public health surveillance lack specific examples. We tested a common assumption that exposure and illness periods coincide and demonstrated how algorithmic bias can arise due to missingness of critical information related to illness and exposure durations. We examined 9407 outbreaks recorded by the United States National Outbreak Reporting System (NORS) from January 1, 2009 through December 31, 2019 and detected algorithmic bias, a systematic over- or under-estimation of foodborne disease outbreak (FBDO) durations due to missing start and end dates. For 7037 (75%) FBDOs with complete date-time information, ~ 60% reported that the exposure period ended before the illness period started. For 2079 (87.7%) FBDOs with missing exposure dates, average illness durations were ~ 5.3 times longer (p < 0.001) than those with complete information, prompting the potential for algorithmic bias. Modern surveillance systems must be equipped with investigative capacities to examine and assess structural data missingness that can lead to bias.

Download full-text PDF

Source
http://dx.doi.org/10.1057/s41271-024-00477-2DOI Listing

Publication Analysis

Top Keywords

algorithmic bias
16
united states
8
states national
8
national outbreak
8
outbreak reporting
8
reporting system
8
missingness algorithmic
4
bias
4
bias example
4
example united
4

Similar Publications

Generative Artificial Intelligence (AI), characterized by its ability to generate diverse forms of content including text, images, video and audio, has revolutionized many fields, including medical education. Generative AI leverages machine learning to create diverse content, enabling personalized learning, enhancing resource accessibility, and facilitating interactive case studies. This narrative review explores the integration of generative artificial intelligence (AI) into orthopedic education and training, highlighting its potential, current challenges, and future trajectory.

View Article and Find Full Text PDF

Background: Medical narratives are fundamental to the correct identification of a patient's health condition. This is not only because it describes the patient's situation. It also contains relevant information about the patient's context and health state evolution.

View Article and Find Full Text PDF

Bias in machine learning applications to address non-communicable diseases at a population-level: a scoping review.

BMC Public Health

December 2024

Upstream Lab, MAP Centre for Urban Health Solutions, Li Ka Shing Knowledge Institute, Unity Health Toronto, 30 Bond Street, Toronto, ON, M5B 1W8, Canada.

Background: Machine learning (ML) is increasingly used in population and public health to support epidemiological studies, surveillance, and evaluation. Our objective was to conduct a scoping review to identify studies that use ML in population health, with a focus on its use in non-communicable diseases (NCDs). We also examine potential algorithmic biases in model design, training, and implementation, as well as efforts to mitigate these biases.

View Article and Find Full Text PDF

Health care decisions are increasingly informed by clinical decision support algorithms, but these algorithms may perpetuate or increase racial and ethnic disparities in access to and quality of health care. Further complicating the problem, clinical data often have missing or poor quality racial and ethnic information, which can lead to misleading assessments of algorithmic bias. We present novel statistical methods that allow for the use of probabilities of racial/ethnic group membership in assessments of algorithm performance and quantify the statistical bias that results from error in these imputed group probabilities.

View Article and Find Full Text PDF

This paper presents a ground motion prediction (GMP) model using an artificial neural network (ANN) for shallow earthquakes, aimed at improving earthquake hazard safety evaluation. The proposed model leverages essential input variables such as moment magnitude, fault type, epicentral distance, and soil type, with the output variable being peak ground acceleration (PGA) at 5% damping. To develop this model, 885 data pairs were obtained from the Pacific Engineering Research Center, providing a robust dataset for training and validation.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!