This paper develops an incremental learning algorithm based on quadratic inference function (QIF) to analyze streaming datasets with correlated outcomes such as longitudinal data and clustered data. We propose a renewable QIF (RenewQIF) method within a paradigm of renewable estimation and incremental inference, in which parameter estimates are recursively renewed with current data and summary statistics of historical data, but with no use of any historical subject-level raw data. We compare our renewable estimation method with both offline QIF and offline generalized estimating equations (GEE) approach that process the entire cumulative subject-level data all together, and show theoretically and numerically that our renewable procedure enjoys statistical and computational efficiency. We also propose an approach to diagnose the homogeneity assumption of regression coefficients via a sequential goodness-of-fit test as a screening procedure on occurrences of abnormal data batches. We implement the proposed methodology by expanding existing Spark's Lambda architecture for the operation of statistical inference and data quality diagnosis. We illustrate the proposed methodology by extensive simulation studies and an analysis of streaming car crash datasets from the National Automotive Sampling System-Crashworthiness Data System (NASS CDS). The supplementary material is available online.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10530766PMC
http://dx.doi.org/10.1080/01621459.2022.2026778DOI Listing

Publication Analysis

Top Keywords

data
11
analysis streaming
8
clustered data
8
abnormal data
8
data batches
8
renewable estimation
8
proposed methodology
8
real-time regression
4
regression analysis
4
streaming clustered
4

Similar Publications

Prognostic Value of Frailty in Patients With Takotsubo Cardiomyopathy.

Clin Cardiol

January 2025

Unidad de Revisiones Sistemáticas y Meta-análisis (URSIGET), Vicerrectorado de Investigación, Universidad San Ignacio de Loyola, Lima, Peru.

Background: There is scarce data on the prognostic value of frailty in patients with Takotsubo cardiomyopathy (TCM). This study aimed to assess the association between frailty and in-hospital outcomes in patients with TCM.

Methods: Adult admissions with TCM were included using the 2016-2019 National Inpatient Sample database.

View Article and Find Full Text PDF

Double bond (C═C) position isomerism in unsaturated lipids can indicate abnormal lipid metabolism and pathological conditions. Novel chemical derivatization and mass spectrometry-based techniques are under continuing development to provide more accurate elucidation of lipid structure in finer structural detail. Here, we introduce a new ion chemistry for annotating lipid C═C positions, which is highly efficient for liquid chromatography-mass spectrometry-based lipidomics.

View Article and Find Full Text PDF

This study evaluated the effects of a critical reflection program utilizing the Lasater Clinical Judgment Rubric (LCJR) reflective questions based on the Clinical Judgment Model (CJM) on newly graduated nurses' clinical judgment skills. A total of 153 newly graduated nurses scheduled for on-site training in a ward nursing unit were divided into a control group (receiving only the usual on-site training with preceptorship) and an experimental group (receiving the developed program with the same on-site training with preceptorship as the control group). Data were collected at baseline, 6 weeks, and 3 months after the intervention.

View Article and Find Full Text PDF

Objective: To develop and validate a nomogram model for predicting central venous catheter-related infections (CRI) in patients with maintenance hemodialysis (MHD).

Methods: MHD patients with central venous catheters (CVCs) visiting the outpatient hemodialysis (HD) center of Xuzhou Medical University Affiliated Hospital from January 2020 to December 2023 were retrospectively selected through a HD monitoring system. Patient data were collected, and the patients were divided into training and validation sets in a 7:3 ratio.

View Article and Find Full Text PDF

Continuous Near-Bed Movements of Microplastics in Open Channel Flows: Statistical Analysis.

Environ Sci Technol

January 2025

Department of Civil and Environmental Engineering, University of Alberta, Edmonton, Alberta T6G 1H9, Canada.

The ubiquitous distribution of microplastics (MPs) in aquatic environments is linked to their transport in rivers and streams. However, the specific mechanism of bedload microplastic (MP) transport, notably their stochastic behaviors, remains an underexplored area. To investigate this, particle tracking velocimetry was employed to examine the continuous near-bed movements of four types of MPs under nine setups with different experimental conditions in a laboratory flume, with an emphasis on their streamwise transport.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!