When the data are stored in a distributed manner, direct applications of traditional statistical inference procedures are often prohibitive due to communication costs and privacy concerns. This paper develops and investigates two Communication-Efficient Accurate Statistical Estimators (CEASE), implemented through iterative algorithms for distributed optimization. In each iteration, node machines carry out computation in parallel and communicate with the central processor, which then broadcasts aggregated information to node machines for new updates. The algorithms adapt to the similarity among loss functions on node machines, and converge rapidly when each node machine has large enough sample size. Moreover, they do not require good initialization and enjoy linear converge guarantees under general conditions. The contraction rate of optimization errors is presented explicitly, with dependence on the local sample size unveiled. In addition, the improved statistical accuracy per iteration is derived. By regarding the proposed method as a multi-step statistical estimator, we show that statistical efficiency can be achieved in finite steps in typical statistical applications. In addition, we give the conditions under which the one-step CEASE estimator is statistically efficient. Extensive numerical experiments on both synthetic and real data validate the theoretical results and demonstrate the superior performance of our algorithms.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10281708 | PMC |
http://dx.doi.org/10.1080/01621459.2021.1969238 | DOI Listing |
Environ Geochem Health
January 2025
School of Environmental Science and Engineering, Shandong University, Qingdao, 266237, China.
Groundwater arsenic (As), contamination is a significant issue worldwide including China and Pakistan, particularly in canal command areas. In this study, 131 groundwater samples were collected, and three machine learning models [Random Forest (RF), Logistic Regression (LR), and Artificial Neural Network (ANN)] were employed to predict As concentration. Descriptive statistics helped to conclude that all of the samples were inside the permitted limit of WHO for pH, Ca, Mg, Turbidity, Cl, K, Na, SO, NO, F and beyond limit of WHO for EC, HCO, TDS, and As.
View Article and Find Full Text PDFBackground: Predictive biomarkers characterizing disease progression are called for in the context of emerging treatments for Alzheimer's disease. We implemented a link prediction model on morphometric correlation networks(MCN) generated from structural MRI.
Method: High-resolution T1MPRAGE images were retrospectively collected at two timepoints (interval 2.
Mol Inform
January 2025
Faculty of Information Technology, HUTECH University, 700000, Ho Chi Minh City, Vietnam.
In recent times, graph representation learning has been becoming a hot research topic which has attracted a lot of attention from researchers. Graph embeddings have diverse applications across fields such as information and social network analysis, bioinformatics and cheminformatics, natural language processing (NLP), and recommendation systems. Among the advanced deep learning (DL) based architectures used in graph representation learning, graph neural networks (GNNs) have emerged as the dominant and highly effective framework.
View Article and Find Full Text PDFSensors (Basel)
December 2024
Power Electronics, Machines and Control (PEMC) Research Institute, University of Nottingham, 15 Triumph Rd, Lenton, Nottingham NG7 2GT, UK.
The accuracy of node localization plays a crucial role in the performance and reliability of wireless sensor networks (WSNs), which are widely utilized in fields like security systems and environmental monitoring. The integrity of these networks is often threatened by the presence of malicious nodes that can disrupt the localization process, leading to erroneous positioning and degraded network functionality. To address this challenge, we propose the security-aware localization using bat-optimized malicious anchor prediction (BO-MAP) algorithm.
View Article and Find Full Text PDFInt J Mol Sci
December 2024
Laboratory for Pathology Dynamics, Department of Biomedical Engineering, Georgia Institute of Technology & Emory University School of Medicine, Atlanta, GA 30322, USA.
The overlapping molecular pathophysiology of Alzheimer's Disease (AD), Amyotrophic Lateral Sclerosis (ALS), and Frontotemporal Dementia (FTD) was analyzed using relationships from a knowledge graph of 33+ million biomedical journal articles. The unsupervised learning rank aggregation algorithm from SemNet 2.0 compared the most important amino acid, peptide, and protein (AAPP) nodes connected to AD, ALS, or FTD.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!