Without careful dissection of the ways in which biases can be encoded into artificial intelligence (AI) health technologies, there is a risk of perpetuating existing health inequalities at scale. One major source of bias is the data that underpins such technologies. The STANDING Together recommendations aim to encourage transparency regarding limitations of health datasets and proactive evaluation of their effect across population groups.
View Article and Find Full Text PDFLarge language models (LLMs) hold promise to serve complex health information needs but also have the potential to introduce harm and exacerbate health disparities. Reliably evaluating equity-related model failures is a critical step toward developing systems that promote health equity. We present resources and methodologies for surfacing biases with potential to precipitate equity-related harms in long-form, LLM-generated answers to medical questions and conduct a large-scale empirical case study with the Med-PaLM 2 LLM.
View Article and Find Full Text PDFAnesthesiol Clin
September 2024
[This corrects the article DOI: 10.2196/31862.].
View Article and Find Full Text PDFArtificial intelligence as a medical device is increasingly being applied to healthcare for diagnosis, risk stratification and resource allocation. However, a growing body of evidence has highlighted the risk of algorithmic bias, which may perpetuate existing health inequity. This problem arises in part because of systemic inequalities in dataset curation, unequal opportunity to participate in research and inequalities of access.
View Article and Find Full Text PDFAifred is a clinical decision support system (CDSS) that uses artificial intelligence to assist physicians in selecting treatments for major depressive disorder (MDD) by providing probabilities of remission for different treatment options based on patient characteristics. We evaluated the utility of the CDSS as perceived by physicians participating in simulated clinical interactions. Twenty physicians who were either staff or residents in psychiatry or family medicine completed a study in which they had three 10-minute clinical interactions with standardized patients portraying mild, moderate, and severe episodes of MDD.
View Article and Find Full Text PDFBackground: Approximately two-thirds of patients with major depressive disorder do not achieve remission during their first treatment. There has been increasing interest in the use of digital, artificial intelligence-powered clinical decision support systems (CDSSs) to assist physicians in their treatment selection and management, improving the personalization and use of best practices such as measurement-based care. Previous literature shows that for digital mental health tools to be successful, the tool must be easy for patients and physicians to use and feasible within existing clinical workflows.
View Article and Find Full Text PDFMuch categorization behavior can be explained by family resemblance: New items are classified by comparison with previously learned exemplars. However, categorization behavior also shows a variety of dimensional biases, where the underlying space has so-called "separable" dimensions: Ease of learning categories depends on how the stimuli align with the separable dimensions of the space. For example, if a set of objects of various sizes and colors can be accurately categorized using a single separable dimension (e.
View Article and Find Full Text PDFBackground: Recently, artificial intelligence-powered devices have been put forward as potentially powerful tools for the improvement of mental healthcare. An important question is how these devices impact the physician-patient interaction.
Aims: Aifred is an artificial intelligence-powered clinical decision support system (CDSS) for the treatment of major depression.
Objective: Determine if deep learning detects sepsis earlier and more accurately than other models. To evaluate model performance using implementation-oriented metrics that simulate clinical practice.
Materials And Methods: We trained internally and temporally validated a deep learning model (multi-output Gaussian process and recurrent neural network [MGP-RNN]) to detect sepsis using encounters from adult hospitalized patients at a large tertiary academic center.
Background: Successful integrations of machine learning into routine clinical care are exceedingly rare, and barriers to its adoption are poorly characterized in the literature.
Objective: This study aims to report a quality improvement effort to integrate a deep learning sepsis detection and management platform, Sepsis Watch, into routine clinical care.
Methods: In 2016, a multidisciplinary team consisting of statisticians, data scientists, data engineers, and clinicians was assembled by the leadership of an academic health system to radically improve the detection and treatment of sepsis.
Since the first recognition of a cluster of novel respiratory viral infections in China in late December 2019, intensivists in the United States have watched with growing concern as infections with the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) virus-now named coronavirus disease of 2019 (COVID-19)-have spread to hospitals in the United States. Because COVID-19 is extremely transmissible and can progress to a severe form of respiratory failure, the potential to overwhelm available critical care resources is high and critical care management of COVID-19 patients has been thrust into the spotlight. COVID-19 arrived in the United States in January and, as anticipated, has dramatically increased the usage of critical care resources.
View Article and Find Full Text PDFFor many biological applications, exploration of the massive parametric space of a mechanism-based model can impose a prohibitive computational demand. To overcome this limitation, we present a framework to improve computational efficiency by orders of magnitude. The key concept is to train a neural network using a limited number of simulations generated by a mechanistic model.
View Article and Find Full Text PDFAn amendment to this paper has been published and can be accessed via a link at the top of the paper.
View Article and Find Full Text PDFStudy Design: Retrospective study.
Objectives: To establish the inter-rater reliability in the quantitative evaluation of spinal cord damage following cervical incomplete spinal cord injury (SCI) utilizing magnetic resonance imaging (MRI). MRI was used to perform manual measurements of the cranial and caudal boundaries of edema, edema length, midsagittal tissue bridge ratio, axial damage ratio, and edema volume in 10 participants with cervical incomplete SCI.
Background: Pythia is an automated, clinically curated surgical data pipeline and repository housing all surgical patient electronic health record (EHR) data from a large, quaternary, multisite health institute for data science initiatives. In an effort to better identify high-risk surgical patients from complex data, a machine learning project trained on Pythia was built to predict postoperative complication risk.
Methods And Findings: A curated data repository of surgical outcomes was created using automated SQL and R code that extracted and processed patient clinical and surgical data across 37 million clinical encounters from the EHRs.
: Following spinal cord injury (SCI), early prediction of future walking ability is difficult, due to factors such as spinal shock, sedation, impending surgery, and secondary long bone fracture. Accurate, objective biomarkers used in the acute stage of SCI would inform individualized patient management and enhance both patient/family expectations and treatment outcomes. Using magnetic resonance imaging (MRI) and specifically a midsagittal T2-weighted image, the amount of tissue bridging (measured as spared spinal cord tissue) shows potential to serve as such a biomarker.
View Article and Find Full Text PDFBackground: Although interest in ketamine use during electroconvulsive therapy (ECT) has increased, studies have been equivocal with regard to its efficacy. The aims of this clinical trial were to evaluate ketamine's antidepressive effects in ECT as a primary anesthetic, determine ketamine's tolerability when compared with standard anesthesia, and determine if plasma brain-derived neurotrophic factor (BDNF) is necessary for treatment response.
Materials And Methods: Adults meeting criteria for treatment-resistant depression undergoing index course ECT received either methohexital (1 to 2 mg/kg) or ketamine (1 to 2 mg/kg) anesthesia in this dual-arm double-blinded randomized clinical trial (NCT02752724).
With ever-increasing advancements in technology, neuroscientists are able to collect data in greater volumes and with finer resolution. The bottleneck in understanding how the brain works is consequently shifting away from the amount and type of data we can collect and toward what we actually do with the data. There has been a growing interest in leveraging this vast volume of data across levels of analysis, measurement techniques, and experimental paradigms to gain more insight into brain function.
View Article and Find Full Text PDFInvestigating the biological bases of social phenotypes is challenging because social behavior is both high-dimensional and richly structured, and biological factors are more likely to influence complex patterns of behavior rather than any single behavior in isolation. The space of all possible patterns of interactions among behaviors is too large to investigate using conventional statistical methods. In order to quantitatively define social phenotypes from natural behavior, we developed a machine learning model to identify and measure patterns of behavior in naturalistic observational data, as well as their relationships to biological, environmental, and demographic sources of variation.
View Article and Find Full Text PDFThe anterior cingulate and lateral prefrontal cortices have been implicated in implementing context-appropriate attentional control, but the learning mechanisms underlying our ability to flexibly adapt the control settings to changing environments remain poorly understood. Here we show that human adjustments to varying control demands are captured by a reinforcement learner with a flexible, volatility-driven learning rate. Using model-based functional magnetic resonance imaging, we demonstrate that volatility of control demand is estimated by the anterior insula, which in turn optimizes the prediction of forthcoming demand in the caudate nucleus.
View Article and Find Full Text PDFWe examined the relationship between cerebral amyloid angiopathy (CAA), Alzheimer's disease neuropathologic changes, other vascular brain pathologies, and cognition in a large multicenter autopsy sample. Data were obtained from the National Alzheimer's Coordinating Center on autopsied subjects (N = 3976) who died between 2002 and 2012. Descriptive statistics and multivariable regression models estimated the associations between CAA and other pathologies, and between CAA severity and cognitive test scores proximal to death.
View Article and Find Full Text PDF