Background: Data from multiple organizations are crucial for advancing learning health systems. However, ethical, legal, and social concerns may restrict the use of standard statistical methods that rely on pooling data. Although distributed algorithms offer alternatives, they may not always be suitable for health frameworks.

Objective: This paper aims to support researchers and data custodians in three ways: (1) providing a concise overview of the literature on statistical inference methods for horizontally partitioned data; (2) describing the methods applicable to generalized linear models (GLM) and assessing their underlying distributional assumptions; (3) adapting existing methods to make them fully usable in health settings.

Methods: A scoping review methodology was employed for the literature mapping, from which methods presenting a methodological framework for GLM analyses with horizontally partitioned data were identified and assessed from the perspective of applicability in health settings. Statistical theory was used to adapt methods and to derive the properties of the resulting estimators.

Results: From the review, 41 articles were selected, and six approaches were extracted for conducting standard GLM-based statistical analysis. However, these approaches assumed evenly and identically distributed data across nodes. Consequently, statistical procedures were derived to accommodate uneven node sample sizes and heterogeneous data distributions across nodes. Workflows and detailed algorithms were developed to highlight information-sharing requirements and operational complexity.

Conclusions: This paper contributes to the field of health analytics by providing an overview of the methods that can be used with horizontally partitioned data, by adapting these methods to the context of heterogeneous health data and by clarifying the workflows and quantities exchanged by the methods discussed. Further analysis of the confidentiality preserved by these methods is needed to fully understand the risk associated with the sharing of summary statistics.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11617597PMC
http://dx.doi.org/10.2196/53622DOI Listing

Publication Analysis

Top Keywords

horizontally partitioned
12
partitioned data
12
methods
10
data
9
scoping review
8
health analytics
8
methods horizontally
8
health
7
statistical
5
distributed statistical
4

Similar Publications

Objectives: This study aimed to investigate the long-term auditory and speech outcomes in children with Incomplete Partition Type I (IP-I) who underwent cochlear implantation (CI) and compared their progress to implanted children with normal cochlea.

Methods: This study tracked 17 children with IP-Ι for an average of 3.5 years post-implantation.

View Article and Find Full Text PDF

Background: Data from multiple organizations are crucial for advancing learning health systems. However, ethical, legal, and social concerns may restrict the use of standard statistical methods that rely on pooling data. Although distributed algorithms offer alternatives, they may not always be suitable for health frameworks.

View Article and Find Full Text PDF

Resource partitioning is crucial for the coexistence of colonial herons, as it allows multiple species to share the same habitat while minimising competition. This study took advantage of a natural experiment in 2006 and 2007 when Black-crowned Night Herons were prevented from breeding at Lake Fetzara in the first year due to the presence of a feral cat. This event provided valuable insight into the spatial and temporal dynamics of nest site selection among coexisting heron species, which consisted of Cattle Egrets (), Little Egrets () and Squacco Herons ().

View Article and Find Full Text PDF

Mobile intertidal animals exhibit various strategies during emersion to mediate the impact of heat and desiccation, including behavioural adaptations such as moving to lower tidal levels and seeking thermal refuges, which can result in spatial partitioning between species within the intertidal environment. We tested whether the limpets (Heterobranchia) and (Patellogastropoda) exhibited differential habitat use during tidal emersion by quantifying their abundance and size distribution in various habitats on two rocky shores on the west coast of Thailand. inhabited higher shore levels with hotter average rock temperatures when emersed as compared to .

View Article and Find Full Text PDF

ClO has been ever-increasingly used as an alternative disinfectant to alleviate antibiotic resistance risk in aquaculture. However, the feasibility of ClO disinfection in reducing antibiotic resistance has not been clarified yet. We comparatively explored the aggregation mechanisms and their effect on extracellular DNA (exDNA) partition and settlement in disinfected aquaculture waters and natural waters.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!