An introduction to machine learning for classification and prediction.

Fam Pract

Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada.

Published: February 2023

Classification and prediction tasks are common in health research. With the increasing availability of vast health data repositories (e.g. electronic medical record databases) and advances in computing power, traditional statistical approaches are being augmented or replaced with machine learning (ML) approaches to classify and predict health outcomes. ML describes the automated process of identifying ("learning") patterns in data to perform tasks. Developing an ML model includes selecting between many ML models (e.g. decision trees, support vector machines, neural networks); model specifications such as hyperparameter tuning; and evaluation of model performance. This process is conducted repeatedly to find the model and corresponding specifications that optimize some measure of model performance. ML models can make more accurate classifications and predictions than their statistical counterparts and confer greater flexibility when modelling unstructured data or interactions between covariates; however, many ML models require larger sample sizes to achieve good classification or predictive performance and have been criticized as "black box" for their poor transparency and interpretability. ML holds potential in family medicine for risk profiling of patients' disease risk and clinical decision support to present additional information at times of uncertainty or high demand. In the future, ML approaches are positioned to become commonplace in family medicine. As such, it is important to understand the objectives that can be addressed using ML approaches and the associated techniques and limitations. This article provides a brief introduction into the use of ML approaches for classification and prediction tasks in family medicine.

Download full-text PDF

Source
http://dx.doi.org/10.1093/fampra/cmac104DOI Listing

Publication Analysis

Top Keywords

classification prediction
12
family medicine
12
machine learning
8
prediction tasks
8
model performance
8
approaches
5
model
5
introduction machine
4
classification
4
learning classification
4

Similar Publications

Background: We investigated the rational extent of regional lymphadenectomy and evaluated the prognostic impact of number-based regional nodal classification in patients with distal cholangiocarcinoma.

Methods: This study included 191 patients with distal cholangiocarcinoma who underwent pancreaticoduodenectomy. The nos.

View Article and Find Full Text PDF

Background And Objectives: Chronic kidney disease (CKD) is known to be associated with increased plasma phosphorylated tau217 (p-tau217) concentrations, potentially confounding the utility of plasma p-tau217 measurements as a marker of amyloid pathology in individuals with suspected Alzheimer disease (AD). In this study, we quantitatively investigate the relationship of plasma p-tau217 concentrations vs estimated glomerular filtration rate (eGFR) in individuals with CKD with and without amyloid pathology.

Methods: This was a retrospective examination of data from 2 observational cohorts from either the Mayo Clinic Study of Aging or the Alzheimer's Disease Research Center cohorts.

View Article and Find Full Text PDF

Composition and functional diversity of soil and water microbial communities in the rice-crab symbiosis system.

PLoS One

January 2025

Department of Earth and Environmental Sciences, California State University, Fresno, CA, United States of America.

Rice-crab co-culture is an environmentally friendly agricultural and aquaculture technology with high economic and ecological value. In order to clarify the structure and function of soil and water microbial communities in the rice-crab symbiosis system, the standard rice-crab field with a ring groove was used as the research object. High-throughput sequencing was performed with rice field water samples to analyze the species and abundance differences of soil bacteria and fungi.

View Article and Find Full Text PDF

We explore the efficacy of multimodal behavioral cues for explainable prediction of personality and interview-specific traits. We utilize elementary head-motion units named kinemes, atomic facial movements termed action units and speech features to estimate these human-centered traits. Empirical results confirm that kinemes and action units enable discovery of multiple trait-specific behaviors while also enabling explainability in support of the predictions.

View Article and Find Full Text PDF

Survival analysis is critical in many fields, particularly in healthcare where it can guide medical decisions. Conventional survival analysis methods like Kaplan-Meier and Cox proportional hazards models to generate survival curves indicating probability of survival v. time have limitations, especially for long-term prediction, due to assumptions that all instances follow a general population-level survival curve.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!