Background And Objectives: Missing data is a ubiquitous problem in longitudinal studies due to the number of patients lost to follow-up. Kernel methods have enriched the machine learning field by successfully managing non-vectorial predictors, such as graphs, strings, and probability distributions, and have emerged as a promising tool for the analysis of complex data stemming from modern healthcare. This paper proposes a new set of kernel methods to handle missing data in the response variables. These methods will be applied to predict long-term changes in glycated haemoglobin (A1c), the primary biomarker used to diagnose and monitor the progression of diabetes mellitus, making emphasis on exploring the predictive potential of continuous glucose monitoring (CGM).
Methods: We propose a new framework of non-linear kernel methods for testing statistical independence, selecting relevant predictors, and quantifying the uncertainty of the resultant predictive models. As a novelty in the clinical analysis, we used a distributional representation of CGM as a predictor and compared its performance with that of traditional diabetes biomarkers.
Results: The results show that, after the incorporation of CGM information, predictive ability increases from R=0.61 to R=0.71. In addition, uncertainty analysis is useful for characterising some subpopulations where predictivity is worsened, and a more personalised clinical follow-up is advisable according to expected patient uncertainty in glucose values.
Conclusions: The proposed methods have proven to deal effectively with missing data. They also have the potential to improve the results of predictive tasks by including new complex objects as explanatory variables and modelling arbitrary dependence relations. The application of these methods to a longitudinal study of diabetes showed that the inclusion of a distributional representation of CGM data provides greater sensitivity in predicting five-year A1c changes than classical diabetes biomarkers and traditional CGM metrics.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/j.cmpb.2022.106905 | DOI Listing |
PLoS One
January 2025
UK Centre for Ecology and Hydrology, Crowmarsh Gifford, Wallingford, United Kingdom.
Surface water plays a vital role in the spread of infectious diseases. Information on the spatial and temporal dynamics of surface water availability is thus critical to understanding, monitoring and forecasting disease outbreaks. Before the launch of Sentinel-1 Synthetic Aperture Radar (SAR) missions, surface water availability has been captured at various spatial scales through approaches based on optical remote sensing data.
View Article and Find Full Text PDFPLoS One
January 2025
Computational Media Lab, University of Texas at Austin, Austin, Texas, United States of America.
Instead of turning to emergency phone systems, social media platforms, such as Twitter, have emerged as alternative and sometimes preferred venues for members of the public in the US to communicate during hurricanes and other natural disasters. However, relevant posts are likely to be missed by responders given the volume of content on platforms. Previous work successfully identified relevant posts through machine-learned methods, but depended on human annotators.
View Article and Find Full Text PDFBackground: The World Health Organization (WHO) recommended cryptococcal antigen (CrAg) screening for people presenting with advanced HIV disease (AHD) and for those with positive CrAg without evidence of meningitis to initiate preemptive antifungal medication. Data on the implementation of WHO recommendations regarding CrAg screening is limited. We estimated pooled prevalence of CrAg screening uptake, cryptococcal antigenemia, lumbar puncture, cryptococcal meningitis and initiation of preemptive antifungal medication from available eligible published studies conducted in Africa.
View Article and Find Full Text PDFPLoS One
January 2025
Nova School of Business and Economics, Universidade Nova de Lisboa, Carcavelos, Portugal.
This empirical study assessed the potential of developing a machine-learning model to identify children and adolescents with poor oral health using only self-reported survey data. Such a model could enable scalable and cost-effective screening and targeted interventions, optimizing limited resources to improve oral health outcomes. To train and test the model, we used data from 2,133 students attending schools in a Portuguese municipality.
View Article and Find Full Text PDFPLoS One
January 2025
China Academy for Rural Development, Zhejiang University, Hangzhou, Zhejiang, China.
Sugar-sweetened beverages (SSBs) and cigarettes are addictive substances and addictive substances are often related in consumption with each other. However, the potential interdependence between SSB and cigarette consumption has not been explored in the literature. As SSB and cigarette consumption have posed a great threat to individual health, the knowledge of such interdependence is critical for policymakers to design and coordinate government interventions.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!