Publications by Shawn N Murphy | LitMetric

Publications by authors named "Shawn N Murphy"

Page 1 of 8

Estimation of prevalence of autoimmune diseases in the United States using electronic health record data.

Aaron H Abend Ingrid He Neil Bahroos Stratos Christianakis Ashley B Crew Shawn N Murphy

J Clin Invest

December 2024

Background: Previous epidemiologic studies of autoimmune diseases in the United States (US) have included a limited number of diseases or used meta-analyses that rely on different data collection methods and analyses for each disease.

Methods: To estimate the prevalence of autoimmune diseases in the US, we used electronic health record data from six large medical systems in the US. We developed a software program using common methodology to compute the estimated prevalence of autoimmune diseases alone and in aggregate that can be readily used by other investigators to replicate or modify the analysis over time.

View Article and Find Full Text PDF

Precision phenotyping for curating research cohorts of patients with unexplained post-acute sequelae of COVID-19.

Alaleh Azhir Jonas Hügel Jiazi Tian Jingya Cheng Ingrid V Bassett Shawn N Murphy

Med

November 2024

Background: Scalable identification of patients with post-acute sequelae of COVID-19 (PASC) is challenging due to a lack of reproducible precision phenotyping algorithms, which has led to suboptimal accuracy, demographic biases, and underestimation of the PASC.

Methods: In a retrospective case-control study, we developed a precision phenotyping algorithm for identifying cohorts of patients with PASC. We used longitudinal electronic health records data from over 295,000 patients from 14 hospitals and 20 community health centers in Massachusetts.

View Article and Find Full Text PDF

IgG testing, immunoglobulin replacement therapy, and infection outcomes in patients with CLL or NHL: real-world evidence.

Jacob D Soumerai Zaid Yousif Thais Gift Raj Desai Lynn Huynh Shawn N Murphy

Blood Adv

August 2024

Patients with chronic lymphocytic leukemia (CLL) and non-Hodgkin lymphoma (NHL) can develop hypogammaglobulinemia, a form of secondary immune deficiency (SID), from the disease and treatments. Patients with hypogammaglobulinemia with recurrent infections may benefit from immunoglobulin replacement therapy (IgRT). This study evaluated patterns of immunoglobulin G (IgG) testing and the effectiveness of IgRT in real-world patients with CLL or NHL.

View Article and Find Full Text PDF

Family history as the strongest predictor of aortic and peripheral aneurysms in patients with intracranial aneurysms.

Pui Man Rosalind Lai Elliot Akama-Garren Anil Can Selena-Rae Tirado Victor M Castro Shawn N Murphy

J Clin Neurosci

August 2024

Objective: Intracranial aneurysms (IA) and aortic aneurysms (AA) are both abnormal dilations of arteries with familial predisposition and have been proposed to share co-prevalence and pathophysiology. Associations of IA and non-aortic peripheral aneurysms are less well-studied. The goal of the study was to understand the patterns of aortic and peripheral (extracranial) aneurysms in patients with IA, and risk factors associated with the development of these aneurysms.

View Article and Find Full Text PDF

Precision Phenotyping for Curating Research Cohorts of Patients with Post-Acute Sequelae of COVID-19 (PASC) as a Diagnosis of Exclusion.

Alaleh Azhir Jonas Hügel Jiazi Tian Jingya Cheng Ingrid V Bassett Shawn N Murphy

medRxiv

April 2024

Scalable identification of patients with the post-acute sequelae of COVID-19 (PASC) is challenging due to a lack of reproducible precision phenotyping algorithms and the suboptimal accuracy, demographic biases, and underestimation of the PASC diagnosis code (ICD-10 U09.9). In a retrospective case-control study, we developed a precision phenotyping algorithm for identifying research cohorts of PASC patients, defined as a diagnosis of exclusion.

View Article and Find Full Text PDF

Reply to Li .

Dinah Foer Zachary H Strasser Jing Cui Katherine N Cahill Joshua A Boyce Shawn N Murphy

Am J Respir Crit Care Med

December 2023

View Article and Find Full Text PDF

Characterization of long COVID temporal sub-phenotypes by distributed representation learning from electronic health record data: a cohort study.

Arianna Dagliati Zachary H Strasser Zahra Shakeri Hossein Abad Jeffrey G Klann Kavishwar B Wagholikar Shawn N Murphy

EClinicalMedicine

October 2023

Background: Characterizing Post-Acute Sequelae of COVID (SARS-CoV-2 Infection), or has been challenging due to the multitude of sub-phenotypes, temporal attributes, and definitions. Scalable characterization of PASC sub-phenotypes can enhance screening capacities, disease management, and treatment planning.

Methods: We conducted a retrospective multi-centre observational cohort study, leveraging longitudinal electronic health record (EHR) data of 30,422 patients from three healthcare systems in the Consortium for the Clinical Characterization of COVID-19 by EHR (4CE).

View Article and Find Full Text PDF

Association of GLP-1 Receptor Agonists with Chronic Obstructive Pulmonary Disease Exacerbations among Patients with Type 2 Diabetes.

Dinah Foer Zachary H Strasser Jing Cui Katherine N Cahill Joshua A Boyce Shawn N Murphy

Am J Respir Crit Care Med

November 2023

Patients with chronic obstructive pulmonary disease (COPD) and type 2 diabetes (T2D) have worse clinical outcomes compared with patients without metabolic dysregulation. GLP-1 (glucagon-like peptide 1) receptor agonists (GLP-1RAs) reduce asthma exacerbation risk and improve FVC in patients with COPD. To determine whether GLP-1RA use is associated with reduced COPD exacerbation rates, and severe and moderate exacerbation risk, compared with other T2D therapies.

View Article and Find Full Text PDF

A broadly applicable approach to enrich electronic-health-record cohorts by identifying patients with complete data: a multisite evaluation.

Jeffrey G Klann Darren W Henderson Michele Morris Hossein Estiri Griffin M Weber Shawn N Murphy

J Am Med Inform Assoc

November 2023

Objective: Patients who receive most care within a single healthcare system (colloquially called a "loyalty cohort" since they typically return to the same providers) have mostly complete data within that organization's electronic health record (EHR). Loyalty cohorts have low data missingness, which can unintentionally bias research results. Using proxies of routine care and healthcare utilization metrics, we compute a per-patient score that identifies a loyalty cohort.

View Article and Find Full Text PDF

A retrospective cohort analysis leveraging augmented intelligence to characterize long COVID in the electronic health record: A precision medicine framework.

Zachary H Strasser Arianna Dagliati Zahra Shakeri Hossein Abad Jeffrey G Klann Kavishwar B Wagholikar Shawn N Murphy

PLOS Digit Health

July 2023

Physical and psychological symptoms lasting months following an acute COVID-19 infection are now recognized as post-acute sequelae of COVID-19 (PASC). Accurate tools for identifying such patients could enhance screening capabilities for the recruitment for clinical trials, improve the reliability of disease estimates, and allow for more accurate downstream cohort analysis. In this retrospective cohort study, we analyzed the EHR of hospitalized COVID-19 patients across three healthcare systems to develop a pipeline for better identifying patients with persistent PASC symptoms (dyspnea, fatigue, or joint pain) after their SARS-CoV-2 infection.

View Article and Find Full Text PDF

Development of a Definition of Postacute Sequelae of SARS-CoV-2 Infection.

Tanayott Thaweethai Sarah E Jolley Elizabeth W Karlson Emily B Levitan Bruce Levy Shawn N Murphy

JAMA

June 2023

Importance: SARS-CoV-2 infection is associated with persistent, relapsing, or new symptoms or other health effects occurring after acute infection, termed postacute sequelae of SARS-CoV-2 infection (PASC), also known as long COVID. Characterizing PASC requires analysis of prospectively and uniformly collected data from diverse uninfected and infected individuals.

Objective: To develop a definition of PASC using self-reported symptoms and describe PASC frequencies across cohorts, vaccination status, and number of infections.

View Article and Find Full Text PDF

Temporal characterization of Alzheimer's Disease with sequences of clinical records.

Hossein Estiri Alaleh Azhir Deborah L Blacker Christine S Ritchie Chirag J Patel Shawn N Murphy

EBioMedicine

June 2023

Background: Alzheimer's Disease (AD) is a complex clinical phenotype with unprecedented social and economic tolls on an ageing global population. Real-world data (RWD) from electronic health records (EHRs) offer opportunities to accelerate precision drug development and scale epidemiological research on AD. A precise characterization of AD cohorts is needed to address the noise abundant in RWD.

View Article and Find Full Text PDF

Severity of COVID-19-Related Illness in Massachusetts, July 2021 to December 2022.

Alaleh Azhir Zachary H Strasser Shawn N Murphy Hossein Estiri

JAMA Netw Open

April 2023

View Article and Find Full Text PDF

Informative missingness: What can we learn from patterns in missing laboratory data in the electronic health record?

Amelia L M Tan Emily J Getzen Meghan R Hutch Zachary H Strasser Alba Gutiérrez-Sacristán Shawn N Murphy

J Biomed Inform

March 2023

Background: In electronic health records, patterns of missing laboratory test results could capture patients' course of disease as well as reflect clinician's concerns or worries for possible conditions. These patterns are often understudied and overlooked. This study aims to identify informative patterns of missingness among laboratory data collected across 15 healthcare system sites in three countries for COVID-19 inpatients.

View Article and Find Full Text PDF

Returning integrated genomic risk and clinical recommendations: The eMERGE study.

Jodell E Linder Aimee Allworth Harris T Bland Pedro J Caraballo Rex L Chisholm Shawn N Murphy

Genet Med

April 2023

Article Synopsis

The study aims to assess the risk of common diseases by considering clinical, monogenic, and polygenic factors, which may be reflected in an individual's family history.
The eMERGE network is enrolling 25,000 individuals in a prospective study to create and return a comprehensive risk assessment report (GIRA) that includes various genetic risk factors and care recommendations.
The GIRA report provides actionable guidelines for health care based on genetic data, highlighting the importance of integrating genetic risk assessment into routine health care practices.

View Article and Find Full Text PDF

Quantifying the phenome-wide disease burden of obesity using electronic health records and genomics.

Jamie R Robinson Robert J Carroll Lisa Bastarache Qingxia Chen James Pirruccello Shawn N Murphy

Obesity (Silver Spring)

December 2022

Objective: High BMI is associated with many comorbidities and mortality. This study aimed to elucidate the overall clinical risk of obesity using a genome- and phenome-wide approach.

Methods: This study performed a phenome-wide association study of BMI using a clinical cohort of 736,726 adults.

View Article and Find Full Text PDF

Estimates of SARS-CoV-2 Omicron BA.2 Subvariant Severity in New England.

Zachary H Strasser Noah Greifer Aboozar Hadavand Shawn N Murphy Hossein Estiri

JAMA Netw Open

October 2022

Importance: The SARS-CoV-2 Omicron subvariant, BA.2, may be less severe than previous variants; however, confounding factors make interpreting the intrinsic severity challenging.

Objective: To compare the adjusted risks of mortality, hospitalization, intensive care unit admission, and invasive ventilation between the BA.

View Article and Find Full Text PDF

I2b2-etl: Python application for importing electronic health data into the informatics for integrating biology and the bedside platform.

Kavishwar B Wagholikar Layne Ainsworth David Zelle Kira Chaney Michael Mendis Shawn N Murphy

Bioinformatics

October 2022

Motivation: The i2b2 platform is used at major academic health institutions and research consortia for querying for electronic health data. However, a major obstacle for wider utilization of the platform is the complexity of data loading that entails a steep curve of learning the platform's complex data schemas. To address this problem, we have developed the i2b2-etl package that simplifies the data loading process, which will facilitate wider deployment and utilization of the platform.

View Article and Find Full Text PDF

SurvMaximin: Robust federated approach to transporting survival risk prediction models.

Xuan Wang Harrison G Zhang Xin Xiong Chuan Hong Griffin M Weber Shawn N Murphy

J Biomed Inform

October 2022

Objective: For multi-center heterogeneous Real-World Data (RWD) with time-to-event outcomes and high-dimensional features, we propose the SurvMaximin algorithm to estimate Cox model feature coefficients for a target population by borrowing summary information from a set of health care centers without sharing patient-level information.

Materials And Methods: For each of the centers from which we want to borrow information to improve the prediction performance for the target population, a penalized Cox model is fitted to estimate feature coefficients for the center. Using estimated feature coefficients and the covariance matrix of the target population, we then obtain a SurvMaximin estimated set of feature coefficients for the target population.

View Article and Find Full Text PDF

Analytics to monitor local impact of the Protecting Access to Medicare Act's imaging clinical decision support requirements.

Vladimir I Valtchinov Shawn N Murphy Ronilda Lacson Nikolay Ikonomov Bingxue K Zhai

J Am Med Inform Assoc

October 2022

Objective: This study aimed is to: (1) extend the Integrating the Biology and the Bedside (i2b2) data and application models to include medical imaging appropriate use criteria, enabling it to serve as a platform to monitor local impact of the Protecting Access to Medicare Act's (PAMA) imaging clinical decision support (CDS) requirements, and (2) validate the i2b2 extension using data from the Medicare Imaging Demonstration (MID) CDS implementation.

Materials And Methods: This study provided a reference implementation and assessed its validity and reliability using data from the MID, the federal government's predecessor to PAMA's imaging CDS program. The Star Schema was extended to describe the interactions of imaging ordering providers with the CDS.

View Article and Find Full Text PDF

Natural Language Processing to Improve Prediction of Incident Atrial Fibrillation Using Electronic Health Records.

Jeffrey M Ashburner Yuchiao Chang Xin Wang Shaan Khurshid Christopher D Anderson Shawn N Murphy

J Am Heart Assoc

August 2022

Background Models predicting atrial fibrillation (AF) risk, such as Cohorts for Heart and Aging Research in Genomic Epidemiology AF (CHARGE-AF), have not performed as well in electronic health records. Natural language processing (NLP) may improve models by using narrative electronic health record text. Methods and Results From a primary care network, we included patients aged ≥65 years with visits between 2003 and 2013 in development (n=32 960) and internal validation cohorts (n=13 992).

View Article and Find Full Text PDF

Use of automatic SQL generation interface to enhance transparency and validity of health-data analysis.

Kavishwar B Wagholikar David Zelle Layne Ainsworth Kira Chaney Alexander J Blood Shawn N Murphy

Inform Med Unlocked

June 2022

Analysis of health data typically requires development of queries using structured query language (SQL) by a data-analyst. As the SQL queries are manually created, they are prone to errors. In addition, accurate implementation of the queries depends on effective communication with clinical experts, that further makes the analysis error prone.

View Article and Find Full Text PDF

Multiview Incomplete Knowledge Graph Integration with application to cross-institutional EHR data harmonization.

Doudou Zhou Ziming Gan Xu Shi Alina Patwari Everett Rush Shawn N Murphy

J Biomed Inform

September 2022

Objective: The growing availability of electronic health records (EHR) data opens opportunities for integrative analysis of multi-institutional EHR to produce generalizable knowledge. A key barrier to such integrative analyses is the lack of semantic interoperability across different institutions due to coding differences. We propose a Multiview Incomplete Knowledge Graph Integration (MIKGI) algorithm to integrate information from multiple sources with partially overlapping EHR concept codes to enable translations between healthcare systems.

View Article and Find Full Text PDF

International electronic health record-derived post-acute sequelae profiles of COVID-19 patients.

Harrison G Zhang Arianna Dagliati Zahra Shakeri Hossein Abad Xin Xiong Clara-Lea Bonzel Shawn N Murphy

NPJ Digit Med

June 2022

The risk profiles of post-acute sequelae of COVID-19 (PASC) have not been well characterized in multi-national settings with appropriate controls. We leveraged electronic health record (EHR) data from 277 international hospitals representing 414,602 patients with COVID-19, 2.3 million control patients without COVID-19 in the inpatient and outpatient settings, and over 221 million diagnosis codes to systematically identify new-onset conditions enriched among patients with COVID-19 during the post-acute period.

View Article and Find Full Text PDF

International comparisons of laboratory values from the 4CE collaborative to predict COVID-19 mortality.

Griffin M Weber Chuan Hong Zongqi Xia Nathan P Palmer Paul Avillach Shawn N Murphy

NPJ Digit Med

June 2022

Given the growing number of prediction algorithms developed to predict COVID-19 mortality, we evaluated the transportability of a mortality prediction algorithm using a multi-national network of healthcare systems. We predicted COVID-19 mortality using baseline commonly measured laboratory values and standard demographic and clinical covariates across healthcare systems, countries, and continents. Specifically, we trained a Cox regression model with nine measured laboratory test values, standard demographics at admission, and comorbidity burden pre-admission.

View Article and Find Full Text PDF

A PHP Error was encountered

Severity: Warning

Message: fopen(/var/lib/php/sessions/ci_session7uqkrrialpj40e6thr7m0crqhuau99n5): Failed to open stream: No space left on device

Filename: drivers/Session_files_driver.php

Line Number: 177

Backtrace:

File: /var/www/html/index.php
Line: 316
Function: require_once

A PHP Error was encountered

Severity: Warning

Message: session_start(): Failed to read session data: user (path: /var/lib/php/sessions)

Filename: Session/Session.php

Line Number: 137

Backtrace:

File: /var/www/html/index.php
Line: 316
Function: require_once