Publications by authors named "Daniel R Harris"

Objectives: Structured data fields, including medication fields involving naloxone, are routinely used to identify opioid overdoses in emergency medical services (EMS) data; between January 2021 and March 2024, there were approximately 1.2 million instances of naloxone administration. in the United States.

View Article and Find Full Text PDF

Housing is an environmental social determinant of health that is linked to mortality and clinical outcomes. We developed a lexicon of housing-related concepts and rule-based natural language processing methods for identifying these housing-related concepts within clinical text. We piloted our methods on several test cohorts: a synthetic cohort generated by ChatGPT for initial infrastructure testing, a cohort with substance use disorders (SUD), and a cohort diagnosed with problems related to housing and economic circumstances (HEC).

View Article and Find Full Text PDF
Article Synopsis
  • In 2022, 10.2% of fatal overdoses were linked to individuals facing homelessness or housing instability, with a growing concern over stimulant involvement in these overdoses in the U.S.
  • Researchers in Kentucky analyzed stimulant-involved fatal overdoses from 2017 to 2021, linking records from a drug overdose surveillance system to electronic health records to assess housing status.
  • Of the 313 overdose cases studied, initially only 10.5% were identified with housing instability, but after further data analysis, this figure rose to 41.5%, indicating that better data connections could enhance overdose surveillance and public health responses.
View Article and Find Full Text PDF

Background: A wealth of clinically relevant information is only obtainable within unstructured clinical narratives, leading to great interest in clinical natural language processing (NLP). While a multitude of approaches to NLP exist, current algorithm development approaches have limitations that can slow the development process. These limitations are exacerbated when the task is emergent, as is the case currently for NLP extraction of signs and symptoms of COVID-19 and postacute sequelae of SARS-CoV-2 infection (PASC).

View Article and Find Full Text PDF

Importance: Buprenorphine significantly reduces opioid-related overdose mortality. From 2002 to 2022, the Drug Addiction Treatment Act of 2000 (DATA 2000) required qualified practitioners to receive a waiver from the Drug Enforcement Agency to prescribe buprenorphine for treatment of opioid use disorder. During this period, waiver uptake among practitioners was modest; subsequent changes need to be examined.

View Article and Find Full Text PDF
Article Synopsis
  • Housing instability significantly impacts health outcomes, particularly in individuals with substance use disorders (SUDs), necessitating comprehensive real-world evidence to enhance research.
  • A study analyzed electronic health records of over 20,000 patients from 2017 to 2021, revealing that traditional diagnostic codes identified only 58.5% of those with housing instability, while alternative methods (addresses and clinical notes) provided additional insights.
  • The findings emphasize the need for integrating both structured and unstructured data sources for understanding housing instability in SUD populations, due to variations in identification methods and the complex nature of patients' housing situations.
View Article and Find Full Text PDF

We present our open-source pipeline for quickly enhancing open data sets with research-focused expansions and show its effectiveness on a cornerstone open data set released by the Cook County government in Illinois. The City of Chicago and Cook County were both early adopters of open data portals and have made a wide variety of data available to the public; we focus on the medical examiner case archive which provides information about deaths recorded by Cook County's Office of the Medical Examiner, including overdoses invaluable to substance use disorder research. Our pipeline derives key variables from open data and links to other publicly available data sets in support of accelerating translational research on substance use disorders.

View Article and Find Full Text PDF

Introduction: Stay-at-home orders during the COVID-19 pandemic decreased population mobility to reduce SARS-CoV-2 infection rates. We empirically tested the hypothesis that this public health measure was associated with a higher likelihood of opioid- and stimulant-involved deaths occurring in homes located in Cook County, Illinois.

Methods: The stay-at-home period was from March 21, 2020 to May 30, 2020.

View Article and Find Full Text PDF

Social determinants of health (SDOH) play a critical role in the risk of harmful drug use. Examining SDOH as a means of differentiating populations with multiple co-occurring substance use disorders (SUDs) is particularly salient in the era of prevalent opioid and stimulant use known as the "Third Wave". This study uses electronic medical records (EMRs) from a safety net hospital system from 14,032 patients in Kentucky from 2017 to 2019 in order to 1) define three types of SUD cohorts with shared/unique risk factors, 2) identify patients with unstable housing using novel methods for EMRs and 3) link patients to their residential neighborhood to obtain quantitative perspective on social vulnerability.

View Article and Find Full Text PDF

Opioid Overdose Network is an effort to generalize and adapt an existing research data network, the Accrual to Clinical Trials (ACT) Network, to support design of trials for survivors of opioid overdoses presenting to emergency departments (ED). Four institutions (Medical University of South Carolina [MUSC], Dartmouth Medical School [DMS], University of Kentucky [UK], and University of California San Diego [UCSD]) worked to adapt the ACT network. The approach that was taken to enhance the ACT network focused on 4 activities: cloning and extending the ACT infrastructure, developing an e-phenotype and corresponding registry, developing portable natural language processing tools to enhance data capture, and developing automated documentation templates to enhance extended data capture.

View Article and Find Full Text PDF

In the past twenty years, the consumption of opioid medications has reached significant proportions, leading to a rise in drug misuse and abuse and increased opioid dependence and related fatalities. Thus, the purpose of this study was to determine whether there are pharmacovigilance signals of abuse, misuse, and dependence and their nature for the following prescription opioids: codeine, dihydrocodeine, fentanyl, oxycodone, pentazocine, and tramadol. Both the pharmacovigilance datasets EudraVigilance (EV) and the FDA Adverse Events Reporting System (FAERS) were analyzed to identify and describe possible misuse-/abuse-/dependence-related issues.

View Article and Find Full Text PDF

We present findings on using natural language processing to classify tobacco-related entries from problem lists found within patient's electronic health records. Problem lists describe health-related issues recorded during a patient's medical visit; these problems are typically followed up upon during subsequent visits and are updated for relevance or accuracy. The mechanics of problem lists vary across different electronic health record systems.

View Article and Find Full Text PDF

Background: The term "doctor and pharmacy shopping" colloquially describes patients with high multiple provider episodes (MPEs)-a threshold count of distinct prescribers and/or pharmacies involved in prescription fulfillment. Opioid-related MPEs are implicated in the global opioid crisis and heavily monitored by government databases such as U.S.

View Article and Find Full Text PDF

We present preliminary findings in extracting semantics from reference data generated by the United States Census Bureau. US Census reference data is based upon surveys designed to collect demographics and other socioeconomic factors by geographical regions. These data sets contain thousands of variables; this complexity makes the reference data difficult to learn, query, and integrate into analyses.

View Article and Find Full Text PDF

We detail the challenges and barriers in applying natural language processing techniques to a collection of medical examiner case investigation notes related to fatal opioid poisonings. Major advances in biomedical informatics have made natural language processing (NLP) of medical texts both a realistic and useful task. Biomedical NLP tools are typically designed to process documents originating from biomedical libraries or electronic health records (EHRs).

View Article and Find Full Text PDF

We present a collection of geodatabase functions which expedite utilizing differential privacy for privacy-aware geospatial analysis of healthcare data. The healthcare domain has a long history of standardization and research communities have developed open-source common data models to support the larger goals of interoperability, reproducibility, and data sharing; these models also standardize geospatial patient data. However, patient privacy laws and institutional regulations complicate geospatial analyses and dissemination of research findings due to protective restrictions in how data and results are shared.

View Article and Find Full Text PDF
Article Synopsis
  • - The
  • HEALing Communities Study (HCS)
  • aims to reduce opioid overdose deaths in 67 U.S. communities heavily impacted by opioid misuse through the
  • Communities That Heal (CTH)
  • intervention, which utilizes community-tailored dashboards to inform and guide decision-making.
  • - These
  • dashboards
  • are web-based tools featuring community-specific data on opioid-related metrics, developed collaboratively by HCS researchers and local stakeholders, to help monitor the effectiveness of implemented strategies.
  • - The study highlights how these dashboards can encourage
  • community-driven solutions
  • to tackle the opioid crisis and provides insights that can be applied to address other public health challenges affecting marginalized groups.
View Article and Find Full Text PDF

We present sig2db as an open-source solution for clinical data warehouses desiring to process natural language from prescription instructions, often referred to as "sigs". In electronic prescribing, the sig is typically an unstructured text field intended to capture all requirements for medication administration. The sig captures certain fields that the structured data may lack such as days supply, time of day, or meal-time considerations.

View Article and Find Full Text PDF

Geocoding, the process of translating addresses to geographic coordinates, is a relatively straight-forward and well-studied process, but limitations due to privacy concerns may restrict usage of geographic data. The impact of these limitations are further compounded by the scale of the data, and in turn, also limits viable geocoding strategies. For example, healthcare data is protected by patient privacy laws in addition to possible institutional regulations that restrict external transmission and sharing of data.

View Article and Find Full Text PDF

Drug repurposing is the identification of novel indication(s) for existing medications. Health claims data provide a burgeoning resource to evaluate pharmacotherapies with repurposing potential. To demonstrate a workflow for drug repurposing using claims data, we assessed the association between prescription of bupropion and stimulant use disorder (StUD) remission.

View Article and Find Full Text PDF
Modeling Terminologies for Reusability in Faceted Systems.

Proc IEEE Int Conf Inf Reuse Integr

August 2017

We integrate heterogeneous terminologies into our category-theoretic model of faceted browsing and show that existing terminologies and vocabularies can be reused as facets in a cohesive, interactive system. Commonly found in online search engines and digital libraries, faceted browsing systems depend upon one or more taxonomies which outline the structure and content of the facets available for user interaction. Controlled vocabularies or terminologies are often curated externally and are available as a reusable resource across systems.

View Article and Find Full Text PDF

We present DELVE (Document ExpLoration and Visualization Engine), a framework for developing interactive visualizations as modular Web-applications to assist researchers with exploratory literature search. The goal for web-applications driven by DELVE is to better satisfy the information needs of researchers and to help explore and understand the state of research in scientific liter ature by providing immersive visualizations that both contain facets and are driven by facets derived from the literature. We base our framework on principles from user-centered design and human-computer interaction (HCI).

View Article and Find Full Text PDF

We demonstrate that closure tables are an effective data structure for developing database-driven applications that query biomedical ontologies and that require cross-querying between multiple ontologies. A closure table stores all available paths within a tree, even those without a direct parent-child relationship; additionally, a node can have multiple ancestors which gives the foundation for supporting linkages between controlled ontologies. We augment the meta-data structure of the ICD9 and ICD10 ontologies included in i2b2, an open source query tool for identifying patient cohorts, to utilize a closure table.

View Article and Find Full Text PDF

We integrate heterogeneous terminologies into our category-theoretic model of faceted browsing and show that existing terminologies and vocabularies can be reused as facets in a cohesive, interactive system. Commonly found in online search engines and digital libraries, faceted browsing systems depend upon one or more taxonomies which outline the structure and content of the facets available for user interaction. Controlled vocabularies or terminologies are often externally curated and are available as a reusable resource across systems.

View Article and Find Full Text PDF

We demonstrate that the open-source i2b2 (Informatics for Integrating Biology and the Bedside) data model can be used to bootstrap rural health analytics and learning networks. These networks promote communication and research initiatives by providing the infrastructure necessary for sharing data and insights across a group of healthcare and research partners. Data integration remains a crucial challenge in connecting rural healthcare sites with a common data sharing and learning network due to the lack of interoperability and standards within electronic health records.

View Article and Find Full Text PDF