Multiscale optimization in neural nets.

IEEE Trans Neural Netw

Dept. of Comput. Sci., Yale Univ., New Haven, CT.

Published: October 2012

One way to speed up convergence in a large optimization problem is to introduce a smaller, approximate version of the problem at a coarser scale and to alternate between relaxation steps for the fine-scale and coarse-scale problems. Such an optimization method for neural networks governed by quite general objective functions is presented. At the coarse scale, there is a smaller approximating neural net which, like the original net, is nonlinear and has a nonquadratic objective function. The transitions and information flow from fine to coarse scale and back do not disrupt the optimization, and the user need only specify a partition of the original fine-scale variables. Thus, the method can be applied easily to many problems and networks. There is generally about a fivefold improvement in estimated cost under the multiscale method. In the networks to which it was applied, a nontrivial speedup by a constant factor of between two and five was observed, independent of problem size. Further improvements in computational cost are very likely to be available, especially for problem-specific multiscale neural net methods.

Download full-text PDF

Source
http://dx.doi.org/10.1109/72.80337DOI Listing

Publication Analysis

Top Keywords

coarse scale
8
neural net
8
multiscale optimization
4
neural
4
optimization neural
4
neural nets
4
nets speed
4
speed convergence
4
convergence large
4
large optimization
4

Similar Publications

Scarce feature points are a critical limitation affecting the accuracy and stability of incremental structure from motion (SfM) in small-scale scenes. In this paper, we propose an incremental SfM method for small-scale scenes, combined with an auxiliary calibration plate. This approach increases the number of feature points in sparse regions, and we randomly generate feature points within those areas.

View Article and Find Full Text PDF

Existing autonomous driving systems face challenges in accurately capturing drivers' cognitive states, often resulting in decisions misaligned with drivers' intentions. To address this limitation, this study introduces a pioneering human-centric spatial cognition detecting system based on drivers' electroencephalogram (EEG) signals. Unlike conventional EEG-based systems that focus on intention recognition or hazard perception, the proposed system can further extract drivers' spatial cognition across two dimensions: relative distance and relative orientation.

View Article and Find Full Text PDF

Despite their widespread adoption, particle-scale simulation methods, such as the Discrete Element Method (DEM), for electrically charged particles in several natural processes and industrial transformations do not include realistic polarization effects. At close distances, these can dominate the particle motion and are impossible to predict by the commonly adopted Coulomb point-charge approximation. Sophisticated mathematical tools can account for uneven charge distributions, predicting an attractive force between a charged particle and a neutral particle or possible attraction between two like-charged particles.

View Article and Find Full Text PDF

Implementation of Time-Averaged Restraints with UNRES Coarse-Grained Model of Polypeptide Chains.

J Chem Theory Comput

January 2025

Faculty of Chemistry, University of Gdańsk, Fahrenheit Union of Universities, ul. Wita Stwosza 63, 80-308 Gdańsk, Poland.

Time-averaged restraints from nuclear magnetic resonance (NMR) measurements have been implemented in the UNRES coarse-grained model of polypeptide chains in order to develop a tool for data-assisted modeling of the conformational ensembles of multistate proteins, intrinsically disordered proteins (IDPs) and proteins with intrinsically disordered regions (IDRs), many of which are essential in cell biology. A numerically stable variant of molecular dynamics with time-averaged restraints has been introduced, in which the total energy is conserved in sections of a trajectory in microcanonical runs, the bath temperature is maintained in canonical runs, and the time-average-restraint-force components are scaled up with the length of the memory window so that the restraints affect the simulated structures. The new approach restores the conformational ensembles used to generate ensemble-averaged distances, as demonstrated with synthetic restraints.

View Article and Find Full Text PDF

The assembly of proteins in membranes plays a key role in many crucial cellular pathways. Despite their importance, characterizing transmembrane assembly remains challenging for experiments and simulations. Equilibrium molecular dynamics simulations do not cover the time scales required to sample the typical transmembrane assembly.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!