Automating Construction of Machine Learning Models With Clinical Big Data: Proposal Rationale and Methods.

Gang Luo Bryan L Stone Michael D Johnson Peter Tarczy-Hornoch Adam B Wilcox Sean D Mooney Xiaoming Sheng Peter J Haug Flory L Nkoy

JMIR Res Protoc

Department of Pediatrics, University of Utah, Salt Lake City, UT, United States.

Published: August 2017

Background: To improve health outcomes and cut health care costs, we often need to conduct prediction/classification using large clinical datasets (aka, clinical big data), for example, to identify high-risk patients for preventive interventions. Machine learning has been proposed as a key technology for doing this. Machine learning has won most data science competitions and could support many clinical activities, yet only 15% of hospitals use it for even limited purposes. Despite familiarity with data, health care researchers often lack machine learning expertise to directly use clinical big data, creating a hurdle in realizing value from their data. Health care researchers can work with data scientists with deep machine learning knowledge, but it takes time and effort for both parties to communicate effectively. Facing a shortage in the United States of data scientists and hiring competition from companies with deep pockets, health care systems have difficulty recruiting data scientists. Building and generalizing a machine learning model often requires hundreds to thousands of manual iterations by data scientists to select the following: (1) hyper-parameter values and complex algorithms that greatly affect model accuracy and (2) operators and periods for temporally aggregating clinical attributes (eg, whether a patient's weight kept rising in the past year). This process becomes infeasible with limited budgets.

Objective: This study's goal is to enable health care researchers to directly use clinical big data, make machine learning feasible with limited budgets and data scientist resources, and realize value from data.

Methods: This study will allow us to achieve the following: (1) finish developing the new software, Automated Machine Learning (Auto-ML), to automate model selection for machine learning with clinical big data and validate Auto-ML on seven benchmark modeling problems of clinical importance; (2) apply Auto-ML and novel methodology to two new modeling problems crucial for care management allocation and pilot one model with care managers; and (3) perform simulations to estimate the impact of adopting Auto-ML on US patient outcomes.

Results: We are currently writing Auto-ML's design document. We intend to finish our study by around the year 2022.

Conclusions: Auto-ML will generalize to various clinical prediction/classification problems. With minimal help from data scientists, health care researchers can use Auto-ML to quickly build high-quality models. This will boost wider use of machine learning in health care and improve patient outcomes.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5596298	PMC
http://dx.doi.org/10.2196/resprot.7757	DOI Listing

Publication Analysis

Top Keywords

machine learning

health care

clinical big

big data

data scientists

care researchers

data

machine

learning

clinical

Similar Publications

Multimodal Pain Recognition in Postoperative Patients: Machine Learning Approach.

JMIR Form Res

January 2025

Department of Computer Science, University of California, Irvine, Irvine, CA, United States.

Ajan Subramanian Rui Cao Emad Kasaeyan Naeini Seyed Amir Hossein Aqajari Thomas D Hughes

Background: Acute pain management is critical in postoperative care, especially in vulnerable patient populations that may be unable to self-report pain levels effectively. Current methods of pain assessment often rely on subjective patient reports or behavioral pain observation tools, which can lead to inconsistencies in pain management. Multimodal pain assessment, integrating physiological and behavioral data, presents an opportunity to create more objective and accurate pain measurement systems.

View Article and Find Full Text PDF

Similar Publications

Cross-Cultural Sense-Making of Global Health Crises: A Text Mining Study of Public Opinions on Social Media Related to the COVID-19 Pandemic in Developed and Developing Economies.

J Med Internet Res

January 2025

Unitat de Recerca i Innovació, Gerència d'Atenció Primària i a la Comunitat de la Catalunya Central, Institut Català de la Salut, Sant Fruitós de Bages, Spain.

Adham Kahlawi Firas Masri Wasim Ahmed Josep Vidal-Alaball

Background: The COVID-19 pandemic reshaped social dynamics, fostering reliance on social media for information, connection, and collective sense-making. Understanding how citizens navigate a global health crisis in varying cultural and economic contexts is crucial for effective crisis communication.

Objective: This study examines the evolution of citizen collective sense-making during the COVID-19 pandemic by analyzing social media discourse across Italy, the United Kingdom, and Egypt, representing diverse economic and cultural contexts.

View Article and Find Full Text PDF

Similar Publications

How should the advancement of large language models affect the practice of science?

Proc Natl Acad Sci U S A

February 2025

Max Planck Institute for Biological Cybernetics, Tübingen, Baden-Württemberg 72076, Germany.

Marcel Binz Stephan Alaniz Adina Roskies Balazs Aczel Carl T Bergstrom

Large language models (LLMs) are being increasingly incorporated into scientific workflows. However, we have yet to fully grasp the implications of this integration. How should the advancement of large language models affect the practice of science? For this opinion piece, we have invited four diverse groups of scientists to reflect on this query, sharing their perspectives and engaging in debate.

View Article and Find Full Text PDF

Similar Publications

Prediction of hip fracture by high-resolution peripheral quantitative computed tomography in older Swedish women.

J Bone Miner Res

January 2025

Sahlgrenska Osteoporosis Centre, Department of Internal Medicine and Clinical Nutrition, Institute of Medicine, University of Gothenburg, Gothenburg, Sweden.

Raju Jaiswal Aldina Pivodic Michail Zoulakis Kristian F Axelsson Henrik Litsne

The socioeconomic burden of hip fractures, the most severe osteoporotic fracture outcome, is increasing and the current clinical risk assessment lacks sensitivity. This study aimed to develop a method for improved prediction of hip fracture by incorporating measurements of bone microstructure and composition derived from high-resolution peripheral quantitative computed tomography (HR-pQCT). In a prospective cohort study of 3028 community-dwelling women aged 75 to 80, all participants answered questionnaires and underwent baseline examinations of anthropometrics and bone by dual x-ray absorptiometry (DXA) and HR-pQCT.

View Article and Find Full Text PDF

Similar Publications

Alzheimer's disease image classification based on enhanced residual attention network.

PLoS One

January 2025

School of Emergency Management, Institute of Disaster Prevention, Sanhe, Hebei, China.

Xiaoli Li Bairui Gong Xinfang Chen Hui Li Guoming Yuan

With the increasing number of patients with Alzheimer's Disease (AD), the demand for early diagnosis and intervention is becoming increasingly urgent. The traditional detection methods for Alzheimer's disease mainly rely on clinical symptoms, biomarkers, and imaging examinations. However, these methods have limitations in the early detection of Alzheimer's disease, such as strong subjectivity in diagnostic criteria, high detection costs, and high misdiagnosis rates.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!