Communication-Efficient Accurate Statistical Estimation.

J Am Stat Assoc

Department of IEOR, Columbia University.

Published: September 2021

When the data are stored in a distributed manner, direct applications of traditional statistical inference procedures are often prohibitive due to communication costs and privacy concerns. This paper develops and investigates two Communication-Efficient Accurate Statistical Estimators (CEASE), implemented through iterative algorithms for distributed optimization. In each iteration, node machines carry out computation in parallel and communicate with the central processor, which then broadcasts aggregated information to node machines for new updates. The algorithms adapt to the similarity among loss functions on node machines, and converge rapidly when each node machine has large enough sample size. Moreover, they do not require good initialization and enjoy linear converge guarantees under general conditions. The contraction rate of optimization errors is presented explicitly, with dependence on the local sample size unveiled. In addition, the improved statistical accuracy per iteration is derived. By regarding the proposed method as a multi-step statistical estimator, we show that statistical efficiency can be achieved in finite steps in typical statistical applications. In addition, we give the conditions under which the one-step CEASE estimator is statistically efficient. Extensive numerical experiments on both synthetic and real data validate the theoretical results and demonstrate the superior performance of our algorithms.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10281708	PMC
http://dx.doi.org/10.1080/01621459.2021.1969238	DOI Listing

Publication Analysis

Top Keywords

node machines

communication-efficient accurate

accurate statistical

sample size

statistical

statistical estimation

estimation data

data stored

stored distributed

distributed manner

Similar Publications

Assessment of groundwater chemistry to predict arsenic contamination from a canal commanded area: applications of different machine learning models.

Environ Geochem Health

January 2025

School of Environmental Science and Engineering, Shandong University, Qingdao, 266237, China.

Fazila Younas Muhammad Fahad Sardar Zahid Ullah Jawad Ali Xiaona Yu

Groundwater arsenic (As), contamination is a significant issue worldwide including China and Pakistan, particularly in canal command areas. In this study, 131 groundwater samples were collected, and three machine learning models [Random Forest (RF), Logistic Regression (LR), and Artificial Neural Network (ANN)] were employed to predict As concentration. Descriptive statistics helped to conclude that all of the samples were inside the permitted limit of WHO for pH, Ca, Mg, Turbidity, Cl, K, Na, SO, NO, F and beyond limit of WHO for EC, HCO, TDS, and As.

View Article and Find Full Text PDF

Similar Publications

Developing Topics.

Alzheimers Dement

December 2024

UCLA, Los Angeles, CA, USA.

Tiffany Luo Chencai Wang Benjamin M Ellingson Ceylan Z Cankurtaran

Background: Predictive biomarkers characterizing disease progression are called for in the context of emerging treatments for Alzheimer's disease. We implemented a link prediction model on morphometric correlation networks(MCN) generated from structural MRI.

Method: High-resolution T1MPRAGE images were retrospectively collected at two timepoints (interval 2.

View Article and Find Full Text PDF

Similar Publications

A Topology-Enhanced Multi-Viewed Contrastive Approach for Molecular Graph Representation Learning and Classification.

Mol Inform

January 2025

Faculty of Information Technology, HUTECH University, 700000, Ho Chi Minh City, Vietnam.

Phu Pham

In recent times, graph representation learning has been becoming a hot research topic which has attracted a lot of attention from researchers. Graph embeddings have diverse applications across fields such as information and social network analysis, bioinformatics and cheminformatics, natural language processing (NLP), and recommendation systems. Among the advanced deep learning (DL) based architectures used in graph representation learning, graph neural networks (GNNs) have emerged as the dominant and highly effective framework.

View Article and Find Full Text PDF

Similar Publications

Enhanced Localization in Wireless Sensor Networks Using a Bat-Optimized Malicious Anchor Node Prediction Algorithm.

Sensors (Basel)

December 2024

Power Electronics, Machines and Control (PEMC) Research Institute, University of Nottingham, 15 Triumph Rd, Lenton, Nottingham NG7 2GT, UK.

Balachandran Nair Premakumari Sreeja Gopikrishnan Sundaram Marco Rivera Patrick Wheeler

The accuracy of node localization plays a crucial role in the performance and reliability of wireless sensor networks (WSNs), which are widely utilized in fields like security systems and environmental monitoring. The integrity of these networks is often threatened by the presence of malicious nodes that can disrupt the localization process, leading to erroneous positioning and degraded network functionality. To address this challenge, we propose the security-aware localization using bat-optimized malicious anchor prediction (BO-MAP) algorithm.

View Article and Find Full Text PDF

Similar Publications

Artificial Intelligence-Assisted Comparative Analysis of the Overlapping Molecular Pathophysiology of Alzheimer's Disease, Amyotrophic Lateral Sclerosis, and Frontotemporal Dementia.

Int J Mol Sci

December 2024

Laboratory for Pathology Dynamics, Department of Biomedical Engineering, Georgia Institute of Technology & Emory University School of Medicine, Atlanta, GA 30322, USA.

Zihan Wei Meghna R Iyer Benjamin Zhao Jennifer Deng Cassie S Mitchell

The overlapping molecular pathophysiology of Alzheimer's Disease (AD), Amyotrophic Lateral Sclerosis (ALS), and Frontotemporal Dementia (FTD) was analyzed using relationships from a knowledge graph of 33+ million biomedical journal articles. The unsupervised learning rank aggregation algorithm from SemNet 2.0 compared the most important amino acid, peptide, and protein (AAPP) nodes connected to AD, ALS, or FTD.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!