In recent years, distributed stochastic algorithms have become increasingly useful in the field of machine learning. However, similar to traditional stochastic algorithms, they face a challenge where achieving high fitness on the training set does not necessarily result in good performance on the test set. To address this issue, we propose to use of a distributed network topology to improve the generalization ability of the algorithms. We specifically focus on the Sharpness-Aware Minimization (SAM) algorithm, which relies on perturbation weights to find the maximum point with better generalization ability. In this paper, we present the decentralized stochastic sharpness-aware minimization (D-SSAM) algorithm, which incorporates the distributed network topology. We also provide sublinear convergence results for non-convex targets, which is comparable to consequence of Decentralized Stochastic Gradient Descent (DSGD). Finally, we empirically demonstrate the effectiveness of these results in deep networks and discuss their relationship to the generalization behavior of SAM.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.neunet.2024.106325DOI Listing

Publication Analysis

Top Keywords

decentralized stochastic
12
sharpness-aware minimization
12
stochastic sharpness-aware
8
stochastic algorithms
8
distributed network
8
network topology
8
generalization ability
8
minimization algorithm
4
algorithm years
4
years distributed
4

Similar Publications

Differential equations are pivotal in modeling and understanding the dynamics of various systems, as they offer insights into their future states through parameter estimation fitted to time series data. In fields such as economy, politics, and biology, the observation data points in the time series are often independently obtained (i.e.

View Article and Find Full Text PDF

Decentralized optimal H fusion estimation for multi-sensor networked systems with two-channel hybrid attacks.

ISA Trans

January 2025

School of Electronic Engineering, Heilongjiang University, Harbin 150080, China. Electronic address:

The article aims at the decentralized optimal H fusion estimation issue for multi-sensor networked systems with insecure network communications, where hybrid attacks consisting of stochastic deception and denial-of-service attacks happen on both the sensor-to-local filter channel and the local filter-to-fusion center channel simultaneously. Some random variables obeying Bernoulli distributions are utilized to depict the hybrid attacks existing in two classes of communication channels in a unified framework. Relying on a novel augmentation method, the fusion estimation error system with globally internal dynamics is obtained.

View Article and Find Full Text PDF

Unique ecology of biofilms and flocs: Bacterial composition, assembly, interaction, and nitrogen metabolism within deteriorated bioreactor inoculated with mature partial nitrification-anammox sludge.

Bioresour Technol

December 2024

Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education, China), School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China. Electronic address:

This work unraveled discrepant ecological patterns between biofilms and flocs in a deteriorated bioreactor inoculated with mature partial nitrification-anammox (PN/A) sludge. Based on 16S rRNA analysis, a comprehensive evaluation of neutral and null models, along with niche width, delineated that the bacterial community assembly in biofilms and flocs was dominantly driven by the stochastic process, and dispersal limitation critically shaped the community assembly. Co-occurrence network analysis revealed that environmental stress caused decentralized and fragmented bacterial colonies, and anammox bacteria were mainly peripheral in biofilms network and less involved in interspecific interactions.

View Article and Find Full Text PDF

Physical extraction of antigen and information.

Proc Natl Acad Sci U S A

September 2024

Department of Physics and Astronomy, University of California, Los Angeles, CA 90095.

To respond and adapt, cells use surface receptors to sense environmental cues. While biochemical signal processing inside the cell is studied in depth, less is known about how physical processes during cell-cell contact impact signal acquisition. New experiments found that fast-evolving immune B cells in germinal centers (GCs) apply force to acquire antigen clusters prior to internalization, suggesting adaptive benefits of physical information extraction.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!