Scalable Electronic Phenotyping For Studying Patient Comorbidities.

AMIA Annu Symp Proc

Biomedical Informatics Training Program, Stanford University, Stanford, CA.

Published: November 2019

Over 75 million Americans have multiple concurrent chronic conditions and medical decision making for these patients is mostly based on retrospective cohort studies. Current methods to generate cohorts of patients with comorbidities are neither scalable nor generalizable. We propose a supervised machine learning algorithm for learning comorbidity phenotypes without requiring manually created training sets. First, we generated myocardial infarction (MI) and type-2 diabetes (T2DM) patient cohorts using ICD9-based imperfectly labeled samples upon which LASSO logistic regression models were trained. Second, we assessed the effects of training sample size, inclusion of physician input, and inclusion of clinical text features on model performance. Using ICD9 codes as our labeling heuristic, we achieved comparable performance to models created using keywords as labeling heuristic. We found that expert input and higher training sample sizes could compensate for the lack of clinical text derived features. However, our best performing model included clinical text as features with a large training sample size.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6371288PMC

Publication Analysis

Top Keywords

training sample
12
clinical text
12
sample size
8
text features
8
labeling heuristic
8
scalable electronic
4
electronic phenotyping
4
phenotyping studying
4
studying patient
4
patient comorbidities
4

Similar Publications

Chemical release data are essential for performing chemical risk assessments to understand the potential exposures arising from industrial processes. Often, these data are unknown or unavailable and must be estimated. A case study of volatile organic compound releases during extrusion-based additive manufacturing is used here to explore the viability of various regression methods for predicting chemical releases to inform chemical assessments.

View Article and Find Full Text PDF

Eutrophication is one of the most relevant concerns due to the risk to water supply and food security. Nitrogen and phosphorus chemical species concentrations determined the risk and magnitude of eutrophication. These analyses are even more relevant in basins with intensive agriculture due to agrochemical discharges.

View Article and Find Full Text PDF

Objective: To provide evidence that catastrophizing is the primer of the cognitive-behavioural model of fear of movement/(re)injury (FAM).

Design: A cross-sectional analysis of 180 outpatients with chronic non-specific low back pain who completed the Pain Catastrophizing Scale (PCS), the Tampa Scale of Kinesiophobia (TSK), the Roland-Morris Disability Questionnaire (RMDQ), the Hospital Anxiety and Depression Scale - Depression (HADS-D), and a pain intensity numerical rating scale (NRS). The intercorrelations of the outcome measures were estimated using Pearson's correlation coefficient (r), and regression analyses were used to examine their predictive values by following the left side of the FAM clockwise from the PCS (p = 0.

View Article and Find Full Text PDF

Exercising regularly promotes health, but these benefits are complicated by acute inflammation induced by exercise. A potential source of inflammation is cell-free DNA (cfDNA), yet the cellular origins, molecular causes, and immune system interactions of exercise-induced cfDNA are unclear. To study these, 10 healthy individuals were randomized to a 12-wk exercise program of either high-intensity tactical training (HITT) or traditional moderate-intensity training (TRAD).

View Article and Find Full Text PDF

Purpose: The Daily Phonotrauma Index (DPI) can quantify pathophysiological mechanisms associated with daily voice use in individuals with phonotraumatic vocal hyperfunction (PVH). Since DPI was developed based on weeklong ambulatory voice monitoring, this study investigated if DPI can achieve comparable performance using (a) short laboratory speech tasks and (b) fewer than 7 days of ambulatory data.

Method: An ambulatory voice monitoring system recorded the vocal function/behavior of 134 females with PVH and vocally healthy matched controls in two different conditions.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!