Health care Big Data studies hold substantial promise for improving clinical practice. Among analytic tools, machine learning (ML) is an important approach that has been widely used by many industries for data-driven decision support. In Big Data, thousands of variables and millions of patient records are commonly encountered, but most data elements cannot be directly used to support decision making. Although many feature-selection tools can help identify relevant data, these tools are typically insufficient to determine a patient data cohort to support learning. Therefore, domain experts with nursing or clinic knowledge play critical roles in determining value criteria or the type of variables that should be included in the patient cohort to maximize project success. We demonstrate this process by extracting a patient cohort (37,506 individuals) to support our ML work (i.e., the production of a proactive strategy to prevent statin adverse events) from 130 million de-identified lives in the OptumLabs™ Data Warehouse.

Download full-text PDF

Source
http://dx.doi.org/10.1177/0193945916673059DOI Listing

Publication Analysis

Top Keywords

big data
12
data cohort
8
machine learning
8
patient cohort
8
data
6
cohort
4
cohort extraction
4
extraction facilitate
4
facilitate machine
4
learning improve
4

Similar Publications

Rapid growth in bio-logging-the use of animal-borne electronic tags to document the movements, behaviour, physiology and environments of wildlife-offers opportunities to mitigate biodiversity threats and expand digital natural history archives. Here we present a vision to achieve such benefits by accounting for the heterogeneity inherent to bio-logging data and the concerns of those who collect and use them. First, we can enable data integration through standard vocabularies, transfer protocols and aggregation protocols, and drive their wide adoption.

View Article and Find Full Text PDF

Background: In-person interaction offers invaluable benefits to people. To guarantee safe in-person activities during a COVID-19 outbreak, effective identification of infectious individuals is essential. In this study, we aim to analyze the impact of screening with antigen tests in schools and workplaces on identifying COVID-19 infections.

View Article and Find Full Text PDF

Background: Coronavirus disease-2019 (COVID-19), caused by SARS-CoV-2 virus infection, is characterized as a multisystem disease, potentially yielding multifaceted consequences on various organs at multiple levels. At the end of 2022, over 90% of the Chinese population was infected by SARS-CoV-2 within 35 days because of adjustments to epidemic prevention and control policies. This short-term change provides an unprecedented opportunity for comparative studies on COVID-19 infection among large populations.

View Article and Find Full Text PDF

Integrating crowdsourced data in the built environment studies: A systematic review.

J Environ Manage

January 2025

Department of Landscape Architecture, University of Nevada, Las Vegas, NV, USA. Electronic address:

The integration of crowdsourced data has become central to contemporary built environment studies, driven by the rapid growth in digital technologies and participatory approaches that characterize modern urbanism. Despite its potential, a systematic framework for its analysis remains underdeveloped. This review, conducted in accordance with the PRISMA protocol, examines the use of crowdsourced data in shaping the built environment, scrutinizing its applications, crowdsourcing techniques, methodologies, and comparison with other big data forms.

View Article and Find Full Text PDF

Background: The pervasiveness of drug culture has become evident in popular music and social media. Previous research has examined drug abuse content in both social media and popular music; however, to our knowledge, the intersection of drug abuse content in these 2 domains has not been explored. To address the ongoing drug epidemic, we analyzed drug-related content on Twitter (subsequently rebranded X), with a specific focus on lyrics.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!