AI Article Synopsis

Article Abstract

Background And Objectives: Missing data is a ubiquitous problem in longitudinal studies due to the number of patients lost to follow-up. Kernel methods have enriched the machine learning field by successfully managing non-vectorial predictors, such as graphs, strings, and probability distributions, and have emerged as a promising tool for the analysis of complex data stemming from modern healthcare. This paper proposes a new set of kernel methods to handle missing data in the response variables. These methods will be applied to predict long-term changes in glycated haemoglobin (A1c), the primary biomarker used to diagnose and monitor the progression of diabetes mellitus, making emphasis on exploring the predictive potential of continuous glucose monitoring (CGM).

Methods: We propose a new framework of non-linear kernel methods for testing statistical independence, selecting relevant predictors, and quantifying the uncertainty of the resultant predictive models. As a novelty in the clinical analysis, we used a distributional representation of CGM as a predictor and compared its performance with that of traditional diabetes biomarkers.

Results: The results show that, after the incorporation of CGM information, predictive ability increases from R=0.61 to R=0.71. In addition, uncertainty analysis is useful for characterising some subpopulations where predictivity is worsened, and a more personalised clinical follow-up is advisable according to expected patient uncertainty in glucose values.

Conclusions: The proposed methods have proven to deal effectively with missing data. They also have the potential to improve the results of predictive tasks by including new complex objects as explanatory variables and modelling arbitrary dependence relations. The application of these methods to a longitudinal study of diabetes showed that the inclusion of a distributional representation of CGM data provides greater sensitivity in predicting five-year A1c changes than classical diabetes biomarkers and traditional CGM metrics.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.cmpb.2022.106905DOI Listing

Publication Analysis

Top Keywords

missing data
12
kernel methods
12
machine learning
8
methods handle
8
handle missing
8
distributional representation
8
representation cgm
8
methods
7
data
5
kernel
4

Similar Publications

Surface water plays a vital role in the spread of infectious diseases. Information on the spatial and temporal dynamics of surface water availability is thus critical to understanding, monitoring and forecasting disease outbreaks. Before the launch of Sentinel-1 Synthetic Aperture Radar (SAR) missions, surface water availability has been captured at various spatial scales through approaches based on optical remote sensing data.

View Article and Find Full Text PDF

Instead of turning to emergency phone systems, social media platforms, such as Twitter, have emerged as alternative and sometimes preferred venues for members of the public in the US to communicate during hurricanes and other natural disasters. However, relevant posts are likely to be missed by responders given the volume of content on platforms. Previous work successfully identified relevant posts through machine-learned methods, but depended on human annotators.

View Article and Find Full Text PDF

Background: The World Health Organization (WHO) recommended cryptococcal antigen (CrAg) screening for people presenting with advanced HIV disease (AHD) and for those with positive CrAg without evidence of meningitis to initiate preemptive antifungal medication. Data on the implementation of WHO recommendations regarding CrAg screening is limited. We estimated pooled prevalence of CrAg screening uptake, cryptococcal antigenemia, lumbar puncture, cryptococcal meningitis and initiation of preemptive antifungal medication from available eligible published studies conducted in Africa.

View Article and Find Full Text PDF

This empirical study assessed the potential of developing a machine-learning model to identify children and adolescents with poor oral health using only self-reported survey data. Such a model could enable scalable and cost-effective screening and targeted interventions, optimizing limited resources to improve oral health outcomes. To train and test the model, we used data from 2,133 students attending schools in a Portuguese municipality.

View Article and Find Full Text PDF

Sugar-sweetened beverages (SSBs) and cigarettes are addictive substances and addictive substances are often related in consumption with each other. However, the potential interdependence between SSB and cigarette consumption has not been explored in the literature. As SSB and cigarette consumption have posed a great threat to individual health, the knowledge of such interdependence is critical for policymakers to design and coordinate government interventions.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!